Computational Approaches for Structure-Based Molecular Characterization and Functional Annotation of the Fusion Protein of Nipah henipavirus

Saikat, Abu Saim Mohammad; Das, Ranjit Chandra; Das, Madhab Chandra

doi:10.3390/ecsoc-26-13530

Open AccessProceeding Paper

Computational Approaches for Structure-Based Molecular Characterization and Functional Annotation of the Fusion Protein of Nipah henipavirus^†

by

Abu Saim Mohammad Saikat

^1,*

,

Ranjit Chandra Das

²

and

Madhab Chandra Das

³

¹

Department of Biochemistry and Molecular Biology, Life Science Faculty, Bangabandhu Sheikh Mujibur Rahman Science and Technology University, Gopalganj 8100, Bangladesh

²

Department of Materials Science and Engineering, Florida State University, Tallahassee, FL 32306, USA

³

Department of Electrical & Electronic Engineering, University of Information Technology and Sciences (UITS), Dhaka 1212, Bangladesh

^*

Author to whom correspondence should be addressed.

^†

Presented at the 26th International Electronic Conference on Synthetic Organic Chemistry, 15–30 November 2022; Available online: https://sciforum.net/event/ecsoc-26.

Chem. Proc. 2022, 12(1), 32; https://doi.org/10.3390/ecsoc-26-13530

Published: 14 November 2022

(This article belongs to the Proceedings of The 26th International Electronic Conference on Synthetic Organic Chemistry)

Download

Browse Figures

Versions Notes

Abstract

:

Throughout history, viral epidemics of varying frequency and intensity have been responsible for inducing panic and causing widespread damage. The Nipah virus has one of the highest rates of fatalities of any infectious disease in the world. There have been cases when severe respiratory distress has resulted in death, and it is known that these cases can cause encephalitis. The appearance of the virus and its ability to spread are affected by several factors. Several strategies have been created to raise awareness about the need for personal hygiene and enhance surveillance within the contaminated zone. This work aimed to determine the characteristics of a previously unidentified protein linked with the fusion of Nipah henipavirus particles. The protein’s secondary structure comprises helix, sheet, turn, and secondary coil structures. The protein is a fusion protein. In addition, the estimated Ramachandran plot provided evidence of the accuracy of the modeled protein structure. This accuracy was then verified by the Z-score-based and local model quality evaluation methods. It is possible to think of the protein as a target for developing prospective therapeutic and vaccine candidates directed against the protein to fight viral infections.

Keywords:

Nipah virus; protein characterization; Protein stability; GRAVY; viral infection

1. Introduction

The Nipah virus (NiV), spread by bats and capable of causing fatal encephalitis in humans, has recently been identified in Malaysia, Bangladesh, Singapore, and India [1,2,3]. It belongs to the order Mononegavirales, which contains other developing lethal zoonotic viruses, including Hendra, Marburg, and Ebola [4]. The virus is thought to be stored naturally in the bodies of Pteropus fruit bats. Humans received NiV from pigs, the intermediate hosts of the virus, in 1998, during the first documented epidemic in the Malaysian town of Sungai [5,6,7]. Since 2001, the intake of raw date palm sap contaminated with the saliva and excreta of bats has been reported as the source of yearly NiV outbreaks in various districts of Bangladesh. The first epidemic in India was recorded in Siliguri, West Bengal, in 2001, and it was mainly spread by intimate personal contact or nosocomial transmission. In 2007, a second outbreak was reported in Nadia and West Bengal [7,8]. In a recent NiV epidemic in the Kozhikode region of Kerala, a state in South India, the index patient was said to have been infected by fruit-eating bats [9]. While nosocomial transmission accounted for the vast majority of cases, no clinical or statistical data was provided to confirm the frequency of the illness. The most recent epidemic in Kerala had a death rate of 91%, which is typical of all outbreaks [9,10].

Cell–cell fusion (syncytia) in lung, brain, kidney, and heart tissues is caused by the Nipah (NiV) and Hendra (HeV) viruses. This results in encephalitis, pneumonia, and frequent deaths. Henipavirus infections are characterized by membrane fusion, which is required for viral entry and virus-induced cell–cell fusion [11,12,13,14]. Understanding the pathobiology of henipaviruses relies on elucidating the mechanism(s) of membrane fusion, which may lead to discovering new approaches to creating antiviral therapeutics. Viral attachment (G) and fusion (F) glycoproteins must work together to facilitate membrane fusion in henipaviruses. Current theories of henipavirus fusion propose that F is released from its metastable pre-fusion conformation to promote membrane fusion after NiV or HeV G attachment to its cell surface receptors [11,15,16,17,18]. The selected protein for this study is a fusion protein of Nipah henipavirus, which is associated with viral infections. The physicochemical characteristics and anticipated protein structures of the selected protein demonstrated structure-function relationships of the proteins associated with viral infections. Therefore, this protein can be targeted for predicting antiviral drugs and vaccines against the selected protein to combat viral infections.

2. Materials and Methods

2.1. Protein Sequence Retrieval

The protein sequence (GenBank: QBQ56722.1, NCBI accession: QBQ56722) was retrieved in FASTA format from the NCBI protein sequence database [19].

2.2. Identification of the Physicochemical Properties

The physicochemical characteristics of the protein were demonstrated by using the ExPASy ProtParam tool [20] and the SMS (v.2.0) program [21].

2.3. Secondary Structure Identification and Assessment of the Selected Protein

The SOPMA program [22] was used following the default parameters (output width = 8; the number of conformational states = 4; helix, sheet, turn, and coil; similarity threshold = 8; and window width = 17) to determine the secondary structural parameters. Moreover, the SPIPRED program (v.4.0) [23] was used for the determination of the secondary features and topology of the selected protein.

2.4. Determination and Validation of the Three-Dimensional Protein Structure

The three-dimensional structure of the selected protein was anticipated by using the Modeller [24] with the HHpred interface [25,26]. Moreover, the PROCHECK program of the SAVES program (v.6.0) [27] was used for the structural validation of the modeled 3D structure of the protein. Additionally, the ProSA-web program [28] was used to determine the Z-score of the modeled structure for structural assessment.

3. Results and Discussion

3.1. Sequence Retrieval of the Selected Protein

The protein sequence retrieved from the NCBI database contains 546 amino acid residues (Table 1). The fusion protein (accession no. QBQ56722, version no. QBQ56722.1) is found in the QBQ56722 locus of Nipah henipavirus.

3.2. Physicochemical Parameters Determination of the Selected Protein

The physicochemical parameters of a protein are defined by the characteristics of its constituent amino acids. The alpha-carbon unit of all amino acids, except for glycine, is asymmetric, indicating that it is connected to four distinct chemical constituents (atoms or atom pairs) [29,30]. Consequently, amino acids, except glycine, can appear in two distinct spatial or geometric configurations (i.e., isomers), which resemble the left and right hands [31,32,33]. The ExPASy ProtParam tool identified the physicochemical characteristics of the protein, such as amino acid compositions, atomic compositions, and protein half-life calculations (Figure 1). Leucine is the most abundant amino acid (61, 11.2%) compared to others in the amino acid sequence. Moreover, the atomic composition of the protein demonstrated that hydrogen is the most abundant element (4361, 50.8%), followed by oxygen (817, 9.5%), nitrogen (693, 8.1%), and sulfur (26, 0.3%).

The protein has a molecular weight of about 60,280.90 Da (Table 2) with a theoretical pI of 6.08 (6.30*). The protein has the total number of positively charged residues (Arg + Lys), the whole number of atoms, and the absolute number of negatively charged residues (Asp + Glu) as of 46, 8584, and 48, respectively. As more protein therapies are being developed, many of which have a short plasma half-life, the biotech and pharmaceutical industries are focusing more and more on methods to lengthen that half-life [34,35]. The therapeutic and cost benefits of a longer half-life are apparent. Numerous recognized or in-development biotherapeutics have a short half-life, needing numerous administrations to sustain a therapeutic level over a long period [36,37,38]. The use of half-life extension techniques permits the production of medicines with enhanced pharmacokinetic and pharmacodynamic characteristics that have a prolonged half-life. Incorporating half-life extension methods into the development of numerous biotherapeutics is now standard practice. Various options are available for fine-tuning the half-life and adaptation to the desired treatment method and condition [39,40,41,42]. The anticipated protein half-life is 30 h (mammalian reticulocytes, in vitro); >20 h (yeast, in vivo); and >10 h (Escherichia coli, in vivo).

Efforts are undertaken to establish a relationship between the metabolic stability of proteins and aspects of their primary sequence and to use weight estimates of instability for a protein with an established sequence to determine its resilience properties [43,44,45,46]. Proteins may be evaluated for viability in vitro using the “Instability Index.” If the index is under 40, the substance will likely be stable in the test tube. It is presumably not sustainable if it is more significant [47,48,49]. The instability index of the selected protein is 38.05 (less than 40.00), resulting in a stable nature. The aliphatic index measures how much space is taken up by a protein’s aliphatic side chains compared to its total volume [50]. The thermal stability of proteins is related to their aliphatic index. Proteins with a high aliphatic index are less likely to denature when heated. Hydrophobicity is a property shared by aliphatic amino acids [50,51,52]. The aliphatic index of the selected protein is demonstrated as 112.27. GRAVY is the value employed to demonstrate a protein’s hydrophobicity. This value is computed by accepting the absolute hydropathy values of all amino acids (aa) and splitting that whole by the entire sequence length [53,54,55,56]. The estimated GRAVY of the protein is 0.177.

3.3. Identification and Validation of the Predicted Secondary Structure of the Selected Protein

In the context of a polypeptide chain, the term “secondary structure” refers to the standard and recurrent spatial configurations of neighboring amino acid residues. Hydrogen bonds between amide hydrogens as well as carbonyl oxygens in the peptide backbone are responsible for its stability. Alpha-helices (α-helices) and beta-structures (β-structures) are the two most important types of secondary structures [57,58,59]. The SOPMA program demonstrated that the protein contains an alpha helix (239, 43.77%), an extended strand (112, 0.51%), a beta turn (23, 4.21%), and a random coil (172, 31.50%). No Pi helix, beta bridge, bend region, and ambiguous states were present in the protein (Figure 2). The selected protein contains polar, non-polar, aromatic group-containing, and hydrophobic amino acid residues in its structure (Figure 3). Moreover, the sequence plot demonstrated the protein parameters, including the protein’s helical, coil, and extracellular properties (Figure 3). The secondary structure of the selected protein is illustrated in Figure 4.

3.4. The Three-Dimensional Protein Structure Anticipation and Assessment

The three-dimensional form of a protein is known as its tertiary structure. One primary “backbone” polypeptide chain in the tertiary structure comprises one or more protein secondary structures (PSSs) called domains [60,61,62]. There are a variety of possible interactions and bonds between amino acid side chains. The sequence-structure gap (SSG) is a significant obstacle in computational biology and chemistry, and protein structure anticipation is one strategy to close this gap. Accurately predicting the structure of a protein is critical since protein structure dictates its function [60,63,64]. The most favored protein template (HHpred ID: 2B9B_A) was selected for anticipation of the three-dimensional protein structure by the Modeller program with the HHpred interface with a probability of 100%, an E-value of 2.8 × 10⁻¹³², and a target length of 497 (Figure 5).

The estimated Ramachandran plot calculations of the selected protein were as follows: residues in most favored regions (411, 91.9%); residues in additional allowed regions (30, 6.7%); residues in generously allowed regions (6, 1.3%); number of non-glycine and non-proline residues (447, 100.0%); and there was no residue in disallowed regions (Figure 5). Moreover, the local model assessment and the overall model quality by Z-score (−7.26) assessed the anticipated protein model quality and validated the structure of the protein.

4. Conclusions

NiV has developed into a fatal zoonotic disease. Bats, the natural reservoir of the virus, are adept at viral propagation, and human outbreaks continue to be documented routinely. Since bats may be found worldwide, we might expect to see new epidemics in previously unaffected regions. Acute illness progression and a high death rate make a correct diagnosis challenging. The absence of accessible, affordable diagnostic tests and laboratories to process viral samples makes the situation worse. The total caseload is low, and the course of infection is rapid. Thus, there is a dearth of investigations into human subjects that might yield effective therapy and prevention. The selected protein’s secondary and tertiary characteristics demonstrated the protein structure-based relationships and, therefore, more comprehensive properties of the protein. The protein is a fusion protein deeply associated with viral infection. Therefore, the selected protein can be a target for both protein-based drug and vaccine design against the protein to minimize viral infections.

Author Contributions

Conceptualization, A.S.M.S.; methodology, A.S.M.S.; software, A.S.M.S.; validation, A.S.M.S.; formal analysis, A.S.M.S.; investigation, A.S.M.S.; resources, A.S.M.S.; formal analysis, A.S.M.S.; investigation, A.S.M.S.; resources, A.S.M.S.; writing—original draft preparation, A.S.M.S.; writing—review and editing, A.S.M.S., R.C.D. and M.C.D.; visualization, A.S.M.S.; supervision, A.S.M.S.; project administration, A.S.M.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The author would like to acknowledge the preprint version of the manuscript available at ChemRxiv (DOI:10.26434/chemrxiv-2022-mcm52).

Conflicts of Interest

The author declares no conflict of interest.

References

Goh, K.J.; Tan, C.T.; Chew, N.K.; Tan, P.S.K.; Kamarulzaman, A.; Sarji, S.A.; Wong, K.T.; Abdullah, B.J.J.; Chua, K.B.; Lam, S.K. Clinical features of Nipah virus encephalitis among pig farmers in Malaysia. N. Engl. J. Med. 2000, 342, 1229–1235. [Google Scholar] [CrossRef] [PubMed]
Rahman, M.; Chakraborty, A. Chakraborty, Nipah virus outbreaks in Bangladesh: A deadly infectious disease. WHO South East Asia J. Public Health 2012, 1, 208–212. [Google Scholar] [CrossRef]
Chadha, M.S.; Comer, J.A.; Lowe, L.; Rota, P.A.; Rollin, P.; Bellini, W.J.; Ksiazek, T.G.; Mishra, A.C. Nipah virus-associated encephalitis outbreak, Siliguri, India. Emerg. Infect. Dis. 2006, 12, 235–240. [Google Scholar]
Amarasinghe, G.K.; Ayllón, M.A.; Bào, Y.; Basler, C.F.; Bavari, S.; Blasdell, K.R.; Briese, T.; Brown, P.A.; Bukreyev, A.; Balkema-Buschmann, A.; et al. Taxonomy of the order Mononegavirales: Update 2019. Arch. Virol. 2019, 164, 1967–1980. [Google Scholar] [CrossRef]
Halpin, K.; Hyatt, A.D.; Fogarty, R.; Middleton, D.; Bingham, J.; Epstein, J.H.; Rahman, S.A.; Hughes, T.; Smith, C.; Field, H.E.; et al. Pteropid bats are confirmed as the reservoir hosts of henipaviruses: A comprehensive experimental study of virus transmission. Am. J. Trop. Med. Hyg. 2011, 85, 946–951. [Google Scholar] [CrossRef] [PubMed]
Epstein, J.H.; Field, H.E.; Luby, S.; Pulliam, J.R.; Daszak, P. Nipah virus: Impact, origins, and causes of emergence. Curr. Infect. Dis. Rep. 2006, 8, 59–65. [Google Scholar] [CrossRef] [PubMed]
Banerjee, S.; Gupta, N.; Kodan, P.; Mittal, A.; Ray, Y.; Nischal, N.; Soneja, M.; Biswas, A.; Wig, N. Nipah virus disease: A rare and intractable disease. Intractable Rare Dis. Res. 2019, 8, 1–8. [Google Scholar] [CrossRef]
Luby, S.P.; Rahman, M.; Hossain, M.J.; Blum, L.S.; Husain, M.M.; Gurley, E.; Khan, R.; Ahmed, B.N.; Rahman, S.; Nahar, N.; et al. Foodborne transmission of Nipah virus, Bangladesh. Emerg. Infect. Dis. 2006, 12, 1888–1894. [Google Scholar] [CrossRef] [PubMed]
Arunkumar, G.; Chandni, R.; Mourya, D.T.; Singh, S.K.; Sadanandan, R.; Sudan, P.; Bhargava, B. Outbreak Investigation of Nipah Virus Disease in Kerala, India, 2018. J. Infect. Dis. 2019, 219, 1867–1878. [Google Scholar] [CrossRef]
Pillai, V.S.; Krishna, G.; Veettil, M.V. Nipah Virus: Past Outbreaks and Future Containment. Viruses 2020, 12, 465. [Google Scholar] [CrossRef]
Aguilar, H.C.; Iorio, R.M. Henipavirus membrane fusion and viral entry. Curr. Top. Microbiol. Immunol. 2012, 359, 79–94. [Google Scholar] [PubMed]
Ang, B.S.; Lim, T.C.; Wang, L. Nipah Virus Infection. J. Clin. Microbiol. 2018, 56, e01875-17. [Google Scholar] [CrossRef] [PubMed]
Paul, L. Nipah virus in Kerala: A deadly Zoonosis. Clin. Microbiol. Infect. 2018, 24, 1113–1114. [Google Scholar] [CrossRef] [PubMed]
Hickey, C.A.; Broder, C.C. The mechanism of henipavirus fusion: Examining the relationships between the attachment and fusion glycoproteins. Virol. Sin. 2009, 24, 110–120. [Google Scholar] [CrossRef]
Lee, B.; Ataman, Z.A. Modes of paramyxovirus fusion: A Henipavirus perspective. Trends Microbiol. 2011, 19, 89–399. [Google Scholar] [CrossRef]
Steffen, D.L.; Xu, K.; Nikolov, D.B.; Broder, C.C. Henipavirus mediated membrane fusion, virus entry and targeted therapeutics. Viruses 2012, 4, 280–308. [Google Scholar] [CrossRef] [PubMed]
Yan, D.; Wang, Z. Research Progress in Enveloped Glycoproteins and the Membrane-fusion Mechanism of Nipah Virus. Bing Du Xue Bao 2016, 32, 361–368. [Google Scholar] [PubMed]
Mathieu, C.; Horvat, B. Henipavirus pathogenesis and antiviral approaches. Expert Rev. Anti Infect. Ther. 2015, 13, 343–354. [Google Scholar] [CrossRef]
Sayers, E.W.; Beck, J.; E Bolton, E.; Bourexis, D.; Brister, J.R.; Canese, K.; Comeau, D.C.; Funk, K.; Kim, S.; Klimke, W.; et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2021, 49, D10–D17. [Google Scholar] [CrossRef]
Schneider, M.; Tognolli, M.; Bairoch, A. The Swiss-Prot protein knowledgebase and ExPASy: Providing the plant community with high quality proteomic data and tools. Plant Physiol. Biochem. 2004, 42, 1013–1021. [Google Scholar] [CrossRef]
Stothard, P. The sequence manipulation suite: JavaScript programs for analyzing and formatting protein and DNA sequences. Biotechniques 2000, 28, 1102–1104. [Google Scholar] [CrossRef] [PubMed]
Geourjon, C.; Deléage, G. SOPMA: Significant improvements in protein secondary structure prediction by consensus prediction from multiple alignments. Bioinformatics 1995, 11, 681–684. [Google Scholar] [CrossRef] [PubMed]
Moffat, L.; Jones, D.T. Increasing the accuracy of single sequence prediction methods using a deep semi-supervised learning framework. Bioinformatics 2021, 37, 3744–3751. [Google Scholar] [CrossRef] [PubMed]
Webb, B.; Sali, A. Comparative Protein Structure Modeling Using MODELLER. Curr. Protoc. Bioinform. 2016, 54, 5.6.1–5.6.37. [Google Scholar] [CrossRef] [PubMed]
Levine, T.P.; Daniels, R.D.; Wong, L.; Gatta, A.T.; Gerondopoulos, A.; Barr, F. Discovery of new Longin and Roadblock domains that form platforms for small GTPases in Ragulator and TRAPP-II. Small GTPases 2013, 4, 62–69. [Google Scholar] [CrossRef]
Zimmermann, L.; Stephens, A.; Nam, S.-Z.; Rau, D.; Kübler, J.; Lozajic, M.; Gabler, F.; Söding, J.; Lupas, A.N.; Alva, V. A Completely Reimplemented MPI Bioinformatics Toolkit with a New HHpred Server at its Core. J. Mol. Biol. 2018, 430, 2237–2243. [Google Scholar] [CrossRef]
Laskowski, R.A.; Rullmann, J.A.C.; MacArthur, M.W.; Kaptein, R.; Thornton, J.M. AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR. J. Biomol. NMR 1996, 8, 477–486. [Google Scholar] [CrossRef]
Wiederstein, M.; Sippl, M.J. ProSA-web: Interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res. 2007, 35 (Suppl. S2), W407–W410. [Google Scholar] [CrossRef]
Tran, T.L.N.; Miranda, A.F.; Mouradov, A.; Adhikari, B. Physicochemical Characteristics of Protein Isolated from Thraustochytrid Oilcake. Foods 2020, 9, 779. [Google Scholar] [CrossRef]
Dey, D.; Biswas, P.; Paul, P.; Mahmud, S.; Ema, T.I.; Khan, A.A.; Ahmed, S.Z.; Hasan, M.M.; Saikat, A.S.M.; Fatema, B.; et al. Natural flavonoids effectively block the CD81 receptor of hepatocytes and inhibit HCV infection: A computational drug development approach. Mol. Divers. 2022. [Google Scholar] [CrossRef]
Lu, Z.X.; He, J.F.; Zhang, Y.C.; Bing, D.J. Composition, physicochemical properties of pea protein and its application in functional foods. Crit. Rev. Food Sci. Nutr. 2020, 60, 2593–2605. [Google Scholar] [CrossRef] [PubMed]
Karadag, M.; Arslan, M.; Kaleli, N.E.; Kalyoncu, S. Physicochemical determinants of antibody-protein interactions. Adv. Protein. Chem. Struct Biol. 2020, 121, 85–114. [Google Scholar]
Khan, R.A.; Hossain, R.; Siyadatpanah, A.; Al-Khafaji, K.; Khalipha, A.B.R.; Dey, D.; Asha, U.H.; Biswas, P.; Saikat, A.S.M.; Chenari, H.A.; et al. Diterpenes/Diterpenoids and Their Derivatives as Potential Bioactive Leads against Dengue Virus: A Computational and Network Pharmacology Study. Molecules 2021, 26, 6821. [Google Scholar] [CrossRef]
Kontermann, R.E. Half-life extended biotherapeutics. Expert Opin. Biol. Ther. 2016, 16, 903–915. [Google Scholar] [CrossRef]
Sleep, D. Albumin and its application in drug delivery. Expert Opin. Drug Deliv. 2015, 12, 793–812. [Google Scholar] [CrossRef]
Kontermann, R.E. Strategies for extended serum half-life of protein therapeutics. Curr. Opin. Biotechnol. 2011, 22, 868–876. [Google Scholar] [CrossRef] [PubMed]
Saikat, A.S.M. An In Silico Approach for Potential Natural Compounds as Inhibitors of Protein CDK1/Cks2. Chem. Proc. 2022, 8, 5. [Google Scholar]
Zaman, R.; Islam, R.A.; Ibnat, N.; Othman, I.; Zaini, A.; Lee, C.Y.; Chowdhury, E.H. Current strategies in extending half-lives of therapeutic proteins. J. Control. Release 2019, 301, 176–189. [Google Scholar] [CrossRef] [PubMed]
Louw, A. GR Dimerization and the Impact of GR Dimerization on GR Protein Stability and Half-Life. Front. Immunol. 2019, 10, 1693. [Google Scholar] [CrossRef]
Werle, M.; Bernkop-Schnürch, A. Strategies to improve plasma half life time of peptide and protein drugs. Amino. Acids 2006, 30, 351–367. [Google Scholar] [CrossRef]
Podust, V.N.; Balan, S.; Sim, B.-C.; Coyle, M.P.; Ernst, U.; Peters, R.T.; Schellenberger, V. Extension of in vivo half-life of biologically active molecules by XTEN protein polymers. J. Control. Release 2016, 240, 52–66. [Google Scholar] [CrossRef]
Rajib, H.; Islam; Torequl, M.; Pranta, R.; Divya, J.; Mohammad, S.A.S.; Lutfun, N.; Das, T.A.; Satyajit, S.; Abdulmajid, A.S.; et al. Amentoflavone, New Hope against SARS-CoV-2: An Outlook through its Scientific Records and an in silico Study. Pharmacogn. Res. 2021, 13, 149–157. [Google Scholar]
Niu, X.; Li, N.; Chen, D.; Wang, Z. Interconnection between the protein solubility and amino acid and dipeptide compositions. Protein Pept. Lett. 2013, 20, 88–95. [Google Scholar] [CrossRef]
Huang, H.-L.; Charoenkwan, P.; Kao, T.-F.; Lee, H.-C.; Chang, F.-L.; Huang, W.-L.; Ho, S.-J.; Shu, L.-S.; Chen, W.-L. Prediction and analysis of protein solubility using a novel scoring card method with dipeptide composition. BMC Bioinform. 2012, 13 (Suppl. S17), S3. [Google Scholar] [CrossRef] [PubMed]
Saikat, A.S.M.; Uddin, E.; Ahmad, T.; Mahmud, S.; Imran, A.S.; Ahmed, S.; Alyami, S.A.; Moni, M.A. Structural and Functional Elucidation of IF-3 Protein of Chloroflexus aurantiacus Involved in Protein Biosynthesis: An In Silico Approach. BioMed Res. Int. 2021, 2021, 9050026. [Google Scholar] [CrossRef] [PubMed]
Yagasaki, M.; Hashimoto, S. Synthesis and application of dipeptides; current status and perspectives. Appl. Microbiol. Biotechnol. 2008, 81, 13–22. [Google Scholar] [CrossRef] [PubMed]
Guruprasad, K.; Reddy, B.V.; Pandit, M.W. Correlation between stability of a protein and its dipeptide composition: A novel approach for predicting in vivo stability of a protein from its primary sequence. Protein Eng. 1990, 4, 155–161. [Google Scholar] [CrossRef] [PubMed]
Saikat, A.S.M.; Islam, R.; Mahmud, S.; Imran, M.A.S.; Alam, M.S.; Masud, M.H.; Uddin, M.E. Structural and Functional Annotation of Uncharacterized Protein NCGM946K2_146 of Mycobacterium Tuberculosis: An In-Silico Approach. Proceedings 2020, 66, 13. [Google Scholar]
Gamage, D.G.; Gunaratne, A.; Periyannan, G.; Russell, T.G. Applicability of Instability Index for In vitro Protein Stability Prediction. Protein Pept. Lett. 2019, 26, 339–347. [Google Scholar] [CrossRef]
Panda, S.; Chandra, G. Physicochemical characterization and functional analysis of some snake venom toxin proteins and related non-toxin proteins of other chordates. Bioinformation 2012, 8, 891–896. [Google Scholar] [CrossRef]
Ikai, A. Thermostability and aliphatic index of globular proteins. J. Biochem. 1980, 88, 1895–1898. [Google Scholar] [PubMed]
Enany, S. Structural and functional analysis of hypothetical and conserved proteins of Clostridium tetani. J. Infect. Public Health 2014, 7, 296–307. [Google Scholar] [CrossRef]
Chang, Y.K.; Yang, J.R. Analysis and prediction of highly effective antiviral peptides based on random forests. PLoS ONE 2013, 8, e70166. [Google Scholar] [CrossRef] [PubMed]
Kyte, J.; Doolittle, R.F. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 1982, 157, 105–132. [Google Scholar] [CrossRef] [PubMed]
Thomas, S.; Karnik, S.; Barai, R.; Jayaraman, V.K.; Idicula-Thomas, S. CAMP: A useful resource for research on antimicrobial peptides. Nucleic Acids Res. 2010, 38, D774–80. [Google Scholar] [CrossRef]
Hossain, R.; Quispe, C.; Saikat, A.S.M.; Jain, D.; Habib, A.; Janmeda, P.; Islam, M.T.; Radha, U.; Daştan, S.D.; Kumar, M.; et al. Biosynthesis of Secondary Metabolites Based on the Regulation of MicroRNAs. Biomed Res. Int. 2022, 2022, 9349897. [Google Scholar] [CrossRef]
Gromiha, M.M. Chapter 1—Proteins. In Protein Bioinformatics; Gromiha, M.M., Ed.; Academic Press: Singapore, 2010; pp. 1–27. [Google Scholar]
Wardah, W.; Khan, M.; Sharma, A.; Rashid, M. Protein secondary structure prediction using neural networks and deep learning: A review. Comput. Biol. Chem. 2019, 81, 1–8. [Google Scholar] [CrossRef]
Koch, I.; Schäfer, T. Protein super-secondary structure and quaternary structure topology: Theoretical description and application. Curr. Opin. Struct. Biol. 2018, 50, 134–143. [Google Scholar] [CrossRef]
Jisna, A.V.; Jayaraj, P.B. Protein Structure Prediction: Conventional and Deep Learning Perspectives. Protein J. 2021, 40, 522–544. [Google Scholar] [CrossRef]
Hou, J.; Wu, T.; Cao, R.; Cheng, J. Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13. Proteins 2019, 87, 1165–1178. [Google Scholar] [CrossRef]
Tamburrini, K.C.; Pesce, G.; Nilsson, J.; Gondelaud, F.; Kajava, A.V.; Berrin, J.-G.; Longhi, S. Predicting Protein Conformational Disorder and Disordered Binding Sites. Methods Mol. Biol. 2022, 2449, 95–147. [Google Scholar] [PubMed]
Reinert, E.Z.; Horne, W.S. Protein backbone engineering as a strategy to advance foldamers toward the frontier of protein-like tertiary structure. Org. Biomol. Chem. 2014, 12, 8796–8802. [Google Scholar] [CrossRef] [PubMed]
Shimizu, K.; Cao, W.; Saad, G.; Shoji, M.; Terada, T. Comparative analysis of membrane protein structure databases. Biochim. Biophys. Acta Biomembr. 2018, 1860, 1077–1091. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Physicochemical parameters of the selected protein. (a) The protein contains Ala (28, 5.1%), Arg (18, 3.3%), Asn (36, 6.6%), Asp (23, 4.2%), Cys (12, 2.2%), Gln (22, 4.0%), Glu (25, 4.6%), Gly (30, 5.5%), His (3, 0.5%), Ile (60, 11.0%), Leu (61, 11.2%), Lys (28, 5.1%), Met (14, 2.6%), Phe (13, 2.4%), Pro (17, 3.1%), Ser (51, 9.3%), Thr (40, 7.3%), Trp (1, 0.2%), Tyr (25, 4.6%), and Val (39, 7.1%). (b) The atomic composition of the protein as of carbon (2687, 31.3%), hydrogen (4361, 50.8%), nitrogen (693, 8.1%), oxygen (817, 9.5%), and sulfur (26, 0.3%).

Figure 2. Secondary structural characteristics of the selected protein. The secondary parameters of the selected protein determined the alpha helix (blue color), extended strand (red color), beta-turn (light-green color), and random coil ( purple color).

Figure 3. (a) Amino acid types of the selected protein, and (b) Sequence plot of the selected protein.

Figure 4. The secondary structure of the selected protein.

Figure 5. Tertiary structural anticipation and assessment of the selected protein. (a) The predicted three-dimensional structure, (b) The overall model quality by Z-score (−7.26), (c) The local model quality assessment, and (d) Ramachandran plot statistics obtained from the SAVES program.

Table 1. Protein retrieval.

Protein Individualities	Protein Information
Locus	QBQ56722
Amino acid	546 aa
Accession	QBQ56722
Version	QBQ56722.1
GenBank ID	QBQ56722.1
Source	Nipah henipavirus
Organism	Nipah henipavirus
FASTA sequence	>QBQ56722.1 fusion protein [Nipah henipavirus] MAVILNKRYYSNLLLLILMISECSVGILHYEKLSKIGLVKGITRKYKIKSNPLTKDIVIKMIPNVSNMSQCTGSVMENYKTRLNGILTPIKGALEIYKNNTHDLVGDVRLAGVIMAGVAIGIATAAQITAGVALYEAMKNADNINKLKSSIESTNEAVVKLQETAEKTVYVLTALQDYINTNLVPTIDKISCKQTELSLDLALSKYLSDLLFVFGPNLQDPVSNSMTIQAISQAFGGNYETLLRTLGYATEDFDDLLESDSITGQIIYVDLSGYYIIVRVYFPILTEIQQAYIQELLPVSFNNDNSEWISIVPNFILVRNTLISNIEIGFCLITKRSVICNQDYATPMTNNMRECLTGSTEKCPRELVVSSHVPRFALSNGVLFANCISVTCQCQTTGRAISQSGEQTLLMIDNTTCPTAVLGNVIISLGKYLGSVNYNSEGIAIGPPVFTDKVDISSQISSMNQSLQQSKDYIKEAQRLLDTVNPSLISMLSMIILYVLSIASLCIGLITFISFIIVEKKRNTYSRLEDRRVRPTSSGDLYYIGT

Table 2. Physicochemical parameters of the selected protein.

Parameters	Values
Molecular weight	60,280.90 Da
Theoretical pI	6.08 (6.30 *)
Total number of positively charged residues (Arg + Lys)	46
Total number of negatively charged residues (Asp + Glu)	48
Total number of atoms	8584
Estimated half-life	(a) 30 h (mammalian reticulocytes, in vitro) (b) >20 h (yeast, in vivo) (c) >10 h (Escherichia coli, in vivo)
Instability index (II)	38.05
Aliphatic index	112.27
Grand average of hydropathicity (GRAVY)	0.177

* pI calculated by the SMS v2.0 tool.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Saikat, A.S.M.; Das, R.C.; Das, M.C. Computational Approaches for Structure-Based Molecular Characterization and Functional Annotation of the Fusion Protein of Nipah henipavirus. Chem. Proc. 2022, 12, 32. https://doi.org/10.3390/ecsoc-26-13530

AMA Style

Saikat ASM, Das RC, Das MC. Computational Approaches for Structure-Based Molecular Characterization and Functional Annotation of the Fusion Protein of Nipah henipavirus. Chemistry Proceedings. 2022; 12(1):32. https://doi.org/10.3390/ecsoc-26-13530

Chicago/Turabian Style

Saikat, Abu Saim Mohammad, Ranjit Chandra Das, and Madhab Chandra Das. 2022. "Computational Approaches for Structure-Based Molecular Characterization and Functional Annotation of the Fusion Protein of Nipah henipavirus" Chemistry Proceedings 12, no. 1: 32. https://doi.org/10.3390/ecsoc-26-13530

Article Menu

Computational Approaches for Structure-Based Molecular Characterization and Functional Annotation of the Fusion Protein of Nipah henipavirus^†

Abstract

1. Introduction

2. Materials and Methods

2.1. Protein Sequence Retrieval

2.2. Identification of the Physicochemical Properties

2.3. Secondary Structure Identification and Assessment of the Selected Protein

2.4. Determination and Validation of the Three-Dimensional Protein Structure

3. Results and Discussion

3.1. Sequence Retrieval of the Selected Protein

3.2. Physicochemical Parameters Determination of the Selected Protein

3.3. Identification and Validation of the Predicted Secondary Structure of the Selected Protein

3.4. The Three-Dimensional Protein Structure Anticipation and Assessment

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Computational Approaches for Structure-Based Molecular Characterization and Functional Annotation of the Fusion Protein of Nipah henipavirus †

Abstract

1. Introduction

2. Materials and Methods

2.1. Protein Sequence Retrieval

2.2. Identification of the Physicochemical Properties

2.3. Secondary Structure Identification and Assessment of the Selected Protein

2.4. Determination and Validation of the Three-Dimensional Protein Structure

3. Results and Discussion

3.1. Sequence Retrieval of the Selected Protein

3.2. Physicochemical Parameters Determination of the Selected Protein

3.3. Identification and Validation of the Predicted Secondary Structure of the Selected Protein

3.4. The Three-Dimensional Protein Structure Anticipation and Assessment

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Computational Approaches for Structure-Based Molecular Characterization and Functional Annotation of the Fusion Protein of Nipah henipavirus^†