Possibility of estimating future mutants for influenza: Comparison between previous prediction and subsequent years observation

Mao, Tiantian; Yan, Deyu; Zhou, Mengdi; Qiu, Tianyi; Cao, Zhiwei

doi:10.3389/fmicb.2022.1031672

OPINION article

Front. Microbiol., 05 October 2022

Sec. Evolutionary and Genomic Microbiology

Volume 13 - 2022 | https://doi.org/10.3389/fmicb.2022.1031672

This article is part of the Research Topic Phylogenetics in the One Health Context View all 8 articles

Possibility of estimating future mutants for influenza: Comparison between previous prediction and subsequent years observation

$\r\nTiantian Mao$ Tiantian Mao¹

Deyu Yan¹

Mengdi Zhou¹

Tianyi Qiu^2*

Zhiwei Cao^1,3*

¹Department of Gastroenterology, Shanghai Tenth People’s Hospital, School of Life Sciences and Technology, Tongji University, Shanghai, China
²Institute of Clinical Science, Zhongshan Hospital, Shanghai Medical College, Fudan University, Shanghai, China
³School of Life Sciences, Fudan University, Shanghai, China

Introduction

Despite less cases being reported in recent years, the seasonal influenza remains a concern because of constant genomic mutation and potential antigenic evasion, particularly in surface hemagglutinin (HA) (Petrova and Russell, 2018). Though World Health Organization makes periodic surveillance on circulating strains and recommends the next vaccine components when new variants are detected, months are often needed for vaccine manufacturing, packaging, and distribution (Sabbaghi et al., 2019). This may sometimes cause mismatch between vaccinated components and circulating ones hitting months later, leading to occasional vaccine failure or reduced effectiveness (Poland, 2018). While the long-term and broadly protection vaccines are explored, predicting the future mutational spectrum may be helpful for better preparation and protection.

In recent years, some interesting work tried in this direction with encouraging results obtained (Salama et al., 2016; Yin et al., 2020). In 2016, we proposed a “mutation-selection-ranking” strategy to explore the possibility to pre-estimate the single-site mutational profiles in influenza HA antigenic sites and gave a prediction list for A/H1N1 (Xu et al., 2016). The model was built on 5 well-established antigenic regions (AR) via data between 1918 and 1998. Independent validation on data between 1999 and 2014 showed that over 94% of the newly emerged antigenic sites were covered by predicted profile. At last, based on the template strain of A/H1N1 (A/Alberta/47/2014), a prediction list of future variants was also published at the end of the article, which now has become an ideal test to re-evaluate the prediction possibility. Here, we collected the observed HA mutants before the SARS-CoV-2 outbreak from 2015 to 2019 for annual comparison with the predicted list.

High agreement between prediction and observation by retrospective comparison

According to the Supplementary Table 4 in previous article (Xu et al., 2016), candidate sequences were predicted for 5 ARs, with 20 Ca1, 20 Ca2, 10 Cb, 50 Sa, and 50 Sb variants, respectively. In terms of validation, the three parameters were again taken as previously introduced, covering the type coverage, strain coverage, and the ranking list of mutants in each AR. Basically, the high strain coverage, low type coverage, and top-ranking indicate the successful capture of the future dominant mutants from a large number of observed variants (see Supplementary Methods).

Annual checking was done with results shown in Figure 1A. Firstly, the number of mutant types in each AR fluctuates over time, with a range of 5–30 variants each year. In the subsequent 5 years, the averaged strain and type coverage were seen as 96.5 and 44.8% for Ca1, 98.5 and 50.7% for Ca2, 98.7 and 17.8% for Cb, 97.6 and 56.6% for Sb, and 47.8 and 19.1% for SA (Supplementary Table 1). More importantly, the sharp contrast between low type coverage and high strain coverage suggested the ability to capture those dominant strains among offspring variants on most ARs. It was also noteworthy that the strain coverage of Sa dropped rapidly after 3 years and lost predictive performance afterward. This may be related with the long interval from the template being adopted, where Sa region might mutate at a different rate from others.

FIGURE 1

Figure 1. Prediction performance of A/H1N1 during 2015–2019. (A) Type coverage and strain coverage of five ARs. Each bar shows the total number of unique AR sequences observed in the circulation year, with the colored portion indicating the number of successfully predicted ARs, corresponding to the left axis. The top line indicates the strain coverage for each year, corresponding to the right axis. (B) The ranking in the prediction list for those top 3 dominant ARs observed every year. - indicates the observed variants didn’t be detected in the prediction list, NA indicates that less than 3 dominant variants with abundancy above 5% were observed that year.

In addition, the ranking of the observed dominant mutants was also checked in the prediction list. In Figure 1B, those top circulating mutants in each AR were successfully short-listed by the prediction. Most of the time, the top 1 prevalent was forecasted as the top 1, which is particularly obvious after 2 years of template selection. For example, in 2015 and 2016, all the top 1 community-prevalent variants were predicted as exactly the top 1 excepting Sa region. Surprisingly, in 2015, all the newly emerged sub-dominant mutants (top 2) circulating in the community were successfully predicted as top 1 to top 7. In other words, the to-be-emerged mutants seemed to be foreseen 1 year before circulation. Though the model performance dropped along the time as more mutants emerged, above 5-year comparison confirmed the high possibility of forecasting future dominant mutants, at least within the subsequent 1–2 years.

Discussion

In summary, predicting the future mutational profiles is highly desirable but challenging for rapidly mutating RNA viruses, such as influenza. Here, we compared a prediction list in 2014 with the subsequent 5-year reported variants, and the results highlighted the possibility to predict future mutants for influenza. In addition, we provided another prediction list of A/H1N1 for future seasons (Supplementary Table 2) based on the template of A/Pennsylvania/02/2021, and a prediction list of A/H3N2 (Supplementary Table 3) based on the template of A/Maryland/12239/2021, respectively. Interestingly, multiple mutations were detected involving significant property changes which might cause antigenic variations. For instance, H141Y in Ca2 (rank 7), S74F in Cb (rank 11), K163N in Sa (rank 3) for A/H1N1, or K131E in A (rank 3), F174L in D (rank 2), and so on for A/H3N2. Of note, the emergence of SARS-CoV-2 in 2020, may have dramatically affected the evolution of many respiratory viruses, including influenza virus. The model performance based on pre-SARS-CoV-2 data remains to be tested in the future. Also, current results only apply to single-site mutations in the HA ARs, while predicting the exact mutations at the strain level remains to be explored in the future. At last, though this strategy might theoretically be extended to other viruses, the feasibility needs to be re-evaluated for those newly emerged pathogens like SARS-CoV-2.

Author contributions

ZC and TQ conceived and designed the experiments and supervised the whole project. TM constructed the model and performed the data analysis. DY collected the data. MZ helped to analyze the results. TM, TQ, and ZC wrote the manuscript. All authors reviewed and approved the manuscript.

Funding

This work was supported by the National Key R&D Program of China (2019YFA0905900) and the National Natural Science Foundation of China (32070657, 81830080, and 31900483).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb.2022.1031672/full#supplementary-material

References

Petrova, V. N., and Russell, C. A. (2018). The evolution of seasonal influenza viruses. Nat. Rev. Microbiol. 16, 47–60. doi: 10.1038/nrmicro.2017.118

PubMed Abstract | CrossRef Full Text | Google Scholar

Poland, G. A. (2018). Influenza vaccine failure: failure to protect or failure to understand? Expert. Rev. Vaccines 17, 495–502. doi: 10.1080/14760584.2018.1484284

PubMed Abstract | CrossRef Full Text | Google Scholar

Sabbaghi, A., Miri, S. M., Keshavarz, M., Zargar, M., and Ghaemi, A. (2019). Inactivation methods for whole influenza vaccine production. Rev. Med. Virol. 29:e2074. doi: 10.1002/rmv.2074

PubMed Abstract | CrossRef Full Text | Google Scholar

Salama, M. A., Hassanien, A. E., and Mostafa, A. (2016). The prediction of virus mutation using neural networks and rough set techniques. EURASIP J. Bioinform. Syst. Biol. 13, 016–0042. doi: 10.1186/s13637-016-0042-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Xu, H., Yang, Y., Wang, S., Zhu, R., Qiu, T., Qiu, J., et al. (2016). Predicting the mutating distribution at antigenic sites of the influenza virus. Sci. Rep. 6:20239.

Google Scholar

Yin, R., Luusua, E., Dabrowski, J., Zhang, Y., and Kwoh, C. K. (2020). Tempel: time-series mutation prediction of influenza A viruses via attention-based recurrent neural networks. Bioinformatics 36, 2697–2704.

Google Scholar

Keywords: influenza, hemagglutinin (HA) protein, antigenic region, mutation, mutation profile

Citation: Mao T, Yan D, Zhou M, Qiu T and Cao Z (2022) Possibility of estimating future mutants for influenza: Comparison between previous prediction and subsequent years observation. Front. Microbiol. 13:1031672. doi: 10.3389/fmicb.2022.1031672

Received: 30 August 2022; Accepted: 16 September 2022;
Published: 05 October 2022.

Edited by:

Feng Gao, Tianjin University, China

Reviewed by:

Veljko Veljkovic, University of Belgrade, Serbia

Copyright © 2022 Mao, Yan, Zhou, Qiu and Cao. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Tianyi Qiu, ty_qiu@126.com; Zhiwei Cao, zwcao@fudan.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.