Caracterização do debate no Twitter sobre a vacinação contra a COVID-19 no Brasil
Creators
- 1. Universidade Federal de Minas Gerais
- 2. IBM Research
Description
Our data collection was driven by the goal of gathering a corpus of Portuguese-language tweets that would be informative of the online debate on COVID-19 vaccines. To that end, we used the Twitter API Search to collect tweets based on specific keywords related to COVID-19 vaccination. We built a list of such keywords that include vaccine and health care related terms, as well as words related to the most well known COVID-19 vaccines available so far. Specifically, we consider the following list of keywords: vacinação, vacina, vachina, sputnikv, pfizer, novavax, moderna, coronavac, covaxin, biontech, astrazeneca, bnt162b2. In total, we gathered over 9 million tweets, covering 9 weeks, from December 1st, 2020 to January 31st, 2021. This is an important period that includes the launch of the first worldwide COVID-19 vaccination campaign (launched on December 8th in the United Kingdom), as well as several other important real-world events that influenced and dictated people's discussions.
This dataset is aggregated by weeks and keywords. Only the tweets IDs are available following Twitter's Privacy Policy.
Files
tweet_ids_dataset_pt.zip
Files
(67.5 MB)
Name | Size | Download all |
---|---|---|
md5:8f0998b36e59c067552e2e76c282ce75
|
67.5 MB | Preview Download |