Published July 22, 2021 | Version v1
Conference paper Open

Caracterização do debate no Twitter sobre a vacinação contra a COVID-19 no Brasil

Description

Our data collection was driven by the goal of gathering a corpus of Portuguese-language tweets that would be informative of the online debate on COVID-19 vaccines. To that end, we used the Twitter API Search  to collect tweets based on specific keywords related to COVID-19 vaccination. We built a list of such keywords that include  vaccine and health care related terms, as well as words related to the most well known COVID-19 vaccines available so far. Specifically, we consider the following list of keywords: vacinação, vacina, vachina, sputnikv, pfizer, novavax, moderna, coronavac, covaxin, biontech, astrazeneca, bnt162b2.  In total, we gathered over 9 million tweets, covering 9 weeks, from December 1st, 2020 to January 31st, 2021. This is an important period that includes the launch of the first worldwide COVID-19 vaccination campaign (launched on December 8th in the United Kingdom), as well as several other important real-world events that influenced and dictated people's discussions.

 

This dataset is aggregated by weeks and keywords. Only the tweets IDs are available following Twitter's Privacy Policy. 

Files

tweet_ids_dataset_pt.zip

Files (67.5 MB)

Name Size Download all
md5:8f0998b36e59c067552e2e76c282ce75
67.5 MB Preview Download