Abstract
More and more Internet users are engaging in health-related online activities such as online search for health information or sharing experience in online communities. Geo-tagged and time-stamped data generated from these online activities could inform public health practitioners of the population’s information needs, opinions, attitudes, and behaviors on specific health issues, and health status, becoming an important source of public health surveillance. The heterogeneity and multi-facet nature of social media and search data requires an ecological view to guide big data analysis, interpretation, and visualization. In our work, we recognize the complexity and dynamics of big data, and the efforts of previous scholars in collecting, cleaning/filtering, analyzing, and modeling big data—especially, Google search and Twitter data for public health surveillance. Through eight propositions, we construct an ecological model of Google Search and Twitter data for public health surveillance by differentiating message sources, goals, content, and geo- and time-stamped information. This model monitors public attention of health issues by analyzing information flow from celebrities, media, organizations, blogs, to consumers, who react, by retweeting or searching online. In addition, it detects diseases by examining consumers’ reports of personal experiences on Twitter and online search.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Achrekar, H., Gandhe, A., Lazarus, R., Ssu-Hsin, Y., & Liu, B. (2011). Predicting flu trends using Twitter data. In 2011 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS) (pp. 702–707).
Allem, J.-P., Ferrara, E., Uppu, S. P., Cruz, T. B., & Unger, J. B. (2017). E-Cigarette surveillance with social media data: social bots, emerging topics, and trends. JMIR Public Health Surveillance, 3, e98.
Aramaki, E., Maskawa, S., & Morita, M. (2011). Twitter catches the flu: Detecting influenza epidemics using Twitter. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing (pp 1568–1576). Edinburgh, Scotland, UK: Association for Computational Linguistics.
Askitas, N., & Zimmermann, K. F. (2009). Google econometrics and unemployment forecasting. https://doi.org/10.2139/ssrn.1465341
Battelle, J. (2005). The birth of Google. Wired-San Francisco, 13, 102.
Bertino, E. (2013). Big data—Opportunities and challenges panel position paper. In 2013 IEEE 37th Annual Computer Software Applications Conference (pp. 479–480). IEEE.
Bosley, J. C., Zhao, N. W., Hill, S., Shofer, F. S., Asch, D. A., Becker, L. B., & Merchant, R. M. (2013). Decoding twitter: Surveillance and trends for cardiac arrest and resuscitation communication. Resuscitation, 84, 206–212.
Broniatowski, D. A., Paul, M. J., & Dredze, M. (2013). National and local influenza surveillance through Twitter: An analysis of the 2012–2013 influenza epidemic. PLoS One, 8, e83672.
Brownstein, J. S., Freifeld, C. C., & Madoff, L. C. (2009). Digital disease detection—Harnessing the web for public health surveillance. New England Journal of Medicine, 360, 2153–2157.
Butler, D. (2013). When Google got flu wrong: US outbreak foxes a leading web-based method for tracking seasonal flu. Nature, 494, 155–157.
Carneiro, H. A., & Mylonakis, E. (2009). Google Trends: A web-based tool for real-time surveillance of disease outbreaks. Clinical Infectious Diseases, 49, 1557–1564.
Cha, M., Benevenuto, F., Haddadi, H., & Gummadi, K. (2012). The world of connections and information flow in Twitter. IEEE Transactions on Systems, Man, and Cybernetics—Part A Systems and Humans, 42, 991–998.
Chan, E. H., Sahai, V., Conrad, C., & Brownstein, J. S. (2011). Using web search query data to monitor dengue epidemics: A new model for neglected tropical disease surveillance. PLoS Neglected Tropical Diseases, 5, e1206.
Chen, F., Griffith, A., Cottrell, A., & Wong, Y.-L. (2013). Behavioral responses to epidemics in an online experiment: using virtual diseases to study human behavior. PLoS One, 8, e52814.
Chevalier, J. A., & Mayzlin, D. (2006). The effect of word of mouth on sales: Online book reviews. Journal of Marketing Research, 43, 345–354.
Chew, C., & Eysenbach, G. (2010). Pandemics in the age of Twitter: Content analysis of tweets during the 2009 H1N1 outbreak. PLoS One, 5, e14118.
Chunara, R., Andrews, J. R., & Brownstein, J. S. (2012). Social and news media enable estimation of epidemiological patterns early in the 2010 Haitian cholera outbreak. American Journal of Tropical Medicine and Hygiene, 86, 39–45.
De Choudhury, M., Morris, M. R., & White, R. W. (2014). Seeking and sharing health information online: Comparing search engines and social media. In Proceedings of SIGCHI Conference on Human Factors Computing Systems (pp. 1365–1376). New York, NY, USA: Association for Computing Machinery.
De Vogli, R., Kouvonen, A., & Gimeno, D. (2014). The influence of market deregulation on fast food consumption and body mass index: A cross-national time series analysis. Bulletin of the World Health Organization, 92, 99-107A.
Del Vicario, M., Bessi, A., Zollo, F., Petroni, F., Scala, A., Caldarelli, G., Stanley, H. E., & Quattrociocchi, W. (2016). The spreading of misinformation online. Proceedings of the National Academy of Sciences USA, 113, 554–559.
Doan, S., Ohno-Machado, L., & Collier, N. (2012). Enhancing Twitter data analysis with simple semantic filtering: Example in tracking influenza-like illnesses. In 2012 IEEE Second International Conference on Healthcare Informatics, Imaging and System Biology (pp. 62–71).
Dugas, A. F., Jalalpour, M., Gel, Y., Levin, S., Torcaso, F., Igusa, T., & Rothman, R. E. (2013). Influenza forecasting with Google Flu Trends. PLoS One, 8, e56176.
Duggan, M. (2015). The demographics of social media users. Pew Research Center’s Internet Science & Technology.
Eichstaedt, J. C., Schwartz, H. A., Kern, M. L., et al. (2015). Psychological language on Twitter predicts county-level heart disease mortality. Psychological Science, 26, 159–169.
eMarketer.com (2018) Search referral share, by search engine, US performance metrics, estimates and historical data. In Inside Intell. Retrieved Sep 1, 2019, from https://www.emarketer.com/performance/channel/59ee1f37bfce890eb411f134/58e39a6f2357af0f1484d953.
Ettredge, M., Gerdes, J., & Karuga, G. (2005). Using web-based search data to predict macroeconomic statistics. Communications of the ACM, 48, 87–92.
Eysenbach, G. (2009). Infodemiology and infoveillance: Framework for an emerging set of public health informatics methods to analyze search, communication and publication behavior on the internet. Journal of Medical Internet Research, 11, e11.
Fox, S., & Duggan, M. (2013). Health online 2013. Health (n y), 2013, 1–55.
Ginsberg, J., Mohebbi, M. H., Patel, R. S., Brammer, L., Smolinski, M. S., & Brilliant, L. (2009). Detecting influenza epidemics using search engine query data. Nature, 457, 1012–1014.
Gittelman, S., Lange, V., Crawford, C. A. G., Okoro, C. A., Lieb, E., Dhingra, S. S., & Trimarchi, E. (2015). A new source of data for public health surveillance: Facebook likes. Journal of Medical Internet Research, 17, e98.
Goldstein, S., MacDonald, N. E., & Guirguis, S. (2015). Health communication and vaccine hesitancy. Vaccine, 33, 4212–4214.
Granka, L. A., Joachims, T., & Gay, G. (2004). Eye-tracking analysis of user behavior in WWW search. In Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 478–479). New York, NY, USA: Association for Computing Machinery.
Guan, M., & So, J. (2016). Influence of social identity on self-efficacy beliefs through perceived social support: A social identity theory perspective. Communication Studies, 67, 588–604.
Hay, S. I., George, D. B., Moyes, C. L., & Brownstein, J. S. (2013). Big data opportunities for global infectious disease surveillance. PLoS Medicine, 10, e1001413.
Hulth, A., Rydevik, G., & Linde, A. (2009). Web queries as a source for syndromic surveillance. PLoS One, 4, e4378.
Jansen, B. J., Zhang, M., Sobel, K., & Chowdury, A. (2009). Twitter power: Tweets as electronic word of mouth. Journal of the American Society of Information Science and Technology, 60, 2169–2188.
Janssen, H. G., Davies, I. G., Richardson, L. D., & Stevenson, L. (2018). Determinants of takeaway and fast food consumption: A narrative review. Nutrition Research Reviews, 31, 16–34.
Kwak, H., Lee, C., Park, H., & Moon, S. (2010). What is Twitter, a social network or a news media? In Proceedings of the 19th International Conference on World Wide Web (pp. 591–600). New York, NY, USA: Association for Computing Machinery.
Lamb, A., Paul, M. J., & Dredze, M. (2012). Investigating Twitter as a source for studying behavioral responses to epidemics. In AAAI Fall Symposium on Information Retrieval and Knowledge Discovery in Biomedical Text (pp. 81–83). Citeseer.
Law, M. R., Mintzes, B., & Morgan, S. G. (2011). The sources and popularity of online drug information: An analysis of top search engine results and web page views. Annals of Pharmacotherapy, 45, 350–356.
Lazer, D., Kennedy, R., King, G., & Vespignani, A. (2014). The parable of Google flu: Traps in big data analysis. Science, 343, 1203–1205.
Liang, B., & Scammon, D. L. (2016). Food contamination incidents: What do consumers seek online? Who cares? International Journal of Nonprofit and Voluntary Sector Marketing, 21, 227–241.
Liang, B., Wang, Y., & Tsou, M.-H. (2019). A “fitness” theme may mitigate regional prevalence of overweight and obesity: Evidence from Google search and tweets. Journal of Health Communication, 24, 683–692.
Liang, B., & Scammon, D. L. (2013). Incidence of online health information search: A useful proxy for public health risk perception. Journal of Medical Internet Research, 15, e114.
Marmot, M. (2005). Social determinants of health inequalities. The Lancet, 365, 1099–1104.
Matsaganis, M. D., & Wilkin, H. A. (2015). Communicative social capital and collective efficacy as determinants of access to health-enhancing resources in residential communities. Journal of Health Communication, 20, 377–386.
McMullan, R. D., Berle, D., Arnáez, S., & Starcevic, V. (2019). The relationships between health anxiety, online health information seeking, and cyberchondria: Systematic review and meta-analysis. Journal of Affective Disorders, 245, 270–278.
Metcalfe, D., Price, C., & Powell, J. (2011). Media coverage and public reaction to a celebrity cancer diagnosis. Journal of Public Health, 33, 80–85.
Modave, F., Shokar, N. K., Peñaranda, E., & Nguyen, N. (2014). Analysis of the accuracy of weight loss information search engine results on the internet. American Journal of Public Health, 104, 1971–1978.
Olson, D. R., Konty, K. J., Paladini, M., Viboud, C., & Simonsen, L. (2013). Reassessing Google flu trends data for detection of seasonal and pandemic influenza: A comparative epidemiological study at three geographic scales. PLoS Computational Biology, 9, e1003256.
Oyeyemi, S. O., Gabarron, E., & Wynn, R. (2014). Ebola, Twitter, and misinformation: A dangerous combination? BMJ, 349, g6178.
Pan, B., Hembrooke, H., Joachims, T., Lorigo, L., Gay, G., & Granka, L. (2007). In Google we trust: Users’ decisions on rank, position, and relevance. Journal of Computer-Mediated Communication, 12, 801–823.
Paul, M. J., Dredze, M., & Broniatowski, D. (2014). Twitter improves influenza forecasting. PLoS Currents. https://doi.org/10.1371/currents.outbreaks.90b9ed0f59bae4ccaa683a39865d9117
Paul, M. J., & Dredze, M. (2012). A model for mining public health topics from Twitter. Health (N. Y.), 11.
Pelat, C., Turbelin, C., Bar-Hen, A., Flahault, A., & Valleron, A.-J. (2009). More diseases tracked by using Google Trends. Emerging Infectious Diseases, 15, 1327–1328.
Perrin, A., & Anderson, M. (2019). Share of U.S. adults using social media, including Facebook, is mostly unchanged since 2018. In Pew Research Center. Retrieved April 15, 2021, from https://www.pewresearch.org/fact-tank/2019/04/10/share-of-u-s-adults-using-social-media-including-facebook-is-mostly-unchanged-since-2018/.
Pershad, Y., Hangge, P. T., Albadawi, H., & Oklu, R. (2018). Social medicine: Twitter in healthcare. Journal of Clinical Medicine, 7, 121.
Preece, J., Nonnecke, B., & Andrews, D. (2004). The top five reasons for lurking: Improving community experiences for everyone. Computers in Human Behavior, 20, 201–223.
Reich, J. A. (2020). “We are fierce, independent thinkers and intelligent”: Social capital and stigma management among mothers who refuse vaccines. Social Science & Medicine, 257, 112015.
Reidpath, D. D., Burns, C., Garrard, J., Mahoney, M., & Townsend, M. (2002). An ecological study of the relationship between social and environmental determinants of obesity. Health & Place, 8, 141–145.
Rose, D. E., & Levinson, D. (2004). Understanding user goals in web search. In Proceedings of the 13th International Conference on World Wide Web (pp. 13–19). New York, NY, USA: Association for Computing Machinery.
Saelens, B. E., Sallis, J. F., & Frank, L. D. (2003). Environmental correlates of walking and cycling: Findings from the transportation, urban design, and planning literatures. Annals of Behavioral Medicine, 25, 80–91.
Salathé, M., & Khandelwal, S. (2011). Assessing vaccination sentiments with online social media: Implications for infectious disease dynamics and control. PLoS Computational Biology. https://doi.org/10.1371/journal.pcbi.1002199
Sallis, J. F., Owen, N., & Fisher, E. (2015). Ecological models of health behavior. Health Behavior: Theory, Research, and Practice, 5.
Santillana, M., Nguyen, A. T., Dredze, M., Paul, M. J., Nsoesie, E. O., & Brownstein, J. S. (2015). Combining search, social media, and traditional data sources to improve influenza surveillance. PLoS Computational Biology, 11, e1004513.
Scheitle, C. P. (2011). Google’s Insights for search: A note evaluating the use of search engine data in social research*. Social Science Quarterly, 92, 285–295.
Seifter, A., Schwarzwalder, A., Geis, K., & Aucott, J. (2010). The utility of “Google Trends” for epidemiological research: Lyme disease as an example. Geospatial Health, 4, 135–137.
Silverstein, C., Henzinger, M., Marais, H., & Moricz, M. (1998). Analysis of a very large AltaVista query log. Technical Report 1998-014, Digital SRC.
Southwell, B. G., Dolina, S., Jimenez-Magdaleno, K., Squiers, L. B., & Kelly, B. J. (2016). Zika virus-related news coverage and online behavior, United States, Guatemala, and Brazil. Emerging Infectious Diseases, 22, 1320–1321.
Spink, A., Wolfram, D., Jansen, M. B. J., & Saracevic, T. (2001). Searching the web: The public and their queries. Journal of the American Society of Information Science and Technology, 52, 226–234.
Tsou, M.-H. (2015). Research challenges and opportunities in mapping social media and Big Data. Cartography and Geographic Information Science, 42, 70–74.
Tsou, M.-H., Yang, J.-A., Lusher, D., Han, S., Spitzberg, B., Gawron, J. M., Gupta, D., & An, L. (2013). Mapping social activities and concepts with social media (Twitter) and web search engines (Yahoo and Bing): A case study in 2012 US Presidential Election. Cartography and Geographic Information Science, 40, 337–348.
Velasco, E., Agheneza, T., Denecke, K., Kirchner, G., & Eckmanns, T. (2014). Social media and internet-based data in global systems for public health surveillance: A systematic review. Milbank Quarterly, 92, 7–33.
Wachowicz, M., & Liu, T. (2016). Finding spatial outliers in collective mobility patterns coupled with social ties. International Journal of Geographical Information Science, 30, 1806–1831.
Wang, Y., & Li, Y. (2016). Proactive engagement of opinion leaders and organization advocates on social networking sites. International Journal of Strategic Communications, 10, 115–132.
Wang, Y., & Rodgers, S. (2013). Reporting on health to ethnic populations: A content analysis of local health news in ethnic versus mainstream newspapers. Howard Journal of Communications, 24, 257–274.
Willis, E. (2016). Patients’ self-efficacy within online health communities: Facilitating chronic disease self-management behaviors through peer education. Health Communication, 31, 299–307.
Wilson, K., & Brownstein, J. S. (2009). Early detection of disease outbreaks using the Internet. CMAJ, 180, 829–831.
Wojcik, S., & Hughes, A. (2019). Sizing up Twitter users. Pew Research Center Internet, Science & Technology.
Wu, S., Hofman, J. M., Mason, W. A., & Watts, D. J. (2011). Who says what to whom on twitter. In Proceedings of the 20th International Conference on World Wide Web (pp. 705–714). New York, NY, USA: Association for Computing Machinery.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Liang, B., Wang, Y. (2021). Conceptualizing an Ecological Model of Google Search and Twitter Data in Public Health. In: Nara, A., Tsou, MH. (eds) Empowering Human Dynamics Research with Social Media and Geospatial Data Analytics. Human Dynamics in Smart Cities. Springer, Cham. https://doi.org/10.1007/978-3-030-83010-6_10
Download citation
DOI: https://doi.org/10.1007/978-3-030-83010-6_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-83009-0
Online ISBN: 978-3-030-83010-6
eBook Packages: Social SciencesSocial Sciences (R0)