Actions
  • shareshare
  • link
  • cite
  • add
add
auto_awesome_motion View all 2 versions
Research data . Dataset . 2021

The complete corpus of #COVID-19 Twitter dataset

Antonakaki, Despoina;
Open Access
English
Published: 22 Jun 2021
Publisher: Zenodo
Abstract
COVID-19 pandemic initiated over a year ago continues to spread around the globe and the ongoing research regarding COVID-19 is on a continues growth as well. The online discourse on social media regarding COVID-19 has been growing along with the timeline of the pandemic. Open data on Twitter have been released and offer the research community the opportunity for new findings and resolving this new threat. In this dataset, we open a corpus of Twitter's data from March 2020 till today, that is being updated every day based on the two most important hashtags regarding COVID-19. This dataset will offer the research community the opportunity to explore the social extensions of this pandemic including topic analysis, hate speech sentiment analysis, regarding either the opinion of the users on the pandemic, the comments on the public discourse, or the vaccination releases. The dataset has been collected by retrieving all the tweets that contain the hashtags: #coronavirus and #COVID19 including approximately 208M tweets for hashtags #coronavirus and 392M tweets for hashtag #COVID-19, resulting in a total of 600M tweets.
Subjects by Vocabulary

EOSC: Twitter Data

Subjects

Twitter, COVID-19 dataset

Funded by
EC| CONCORDIA
Project
CONCORDIA
Cyber security cOmpeteNCe fOr Research anD InnovAtion
  • Funder: European Commission (EC)
  • Project Code: 830927
  • Funding stream: H2020 | RIA
Related to Research communities
COVID-19
moresidebar