Grant APA Citation: Stubbs-Richardson, M., Anreddy, S., & Porter, B. (2020). RAPID: Analyses of emotions expressed in social media and forums during the COVID-19 pandemic (Award No. 2031246). National Science Foundation.
Website APA Citation: Stubbs-Richardson, M., Anreddy, S., & Porter, B. (2022, June 29). COVID-19 online prevalence of emotions in institutions database. COPE-ID. Copeid.ssrc.msstate.edu.
Database APA Citation: Stubbs-Richardson, M., Anreddy, S., & Porter, B. (2022, June 29). COVID-19 online prevalence of emotions in institutions database. Data Science for the Social Sciences Laboratory in the Social Science Research Center at Mississippi State University.
COVID-19 ONLINE PREVALENCE OF EMOTIONS IN INSTITUTIONS DATABASE
The content included in the COVID-19 Online Prevalence of Emotions in Institutions Database (COPE-ID) includes data mined from social media and forum-related posts about the COVID-19 pandemic from January 2020 to April 2021 using the following keywords: covid-19, sars-cov-2, corona, coronavirus, coronavirus, coronaviruses, social distancing, quarantine, covid19, pandemic, virus, and #socialdistancing. Each data record is a post or comment about COVID-19 and includes associated meta-data that varies by platform. Supporting CSV files are provided per platform type and have the related meta-data per post.
Below is information about the 10 platforms that COVID-19-related data was mined from using Application Programming Interfaces (APIs), along with details about date ranges, the number of unique posts collected from the sites, and documentation per platform that explains the data files.
4chan
Raw Data Collected: 5.9 MB
Processed Data: 4.4 MB
Unofficial API: 4chan/4chan-APl
Total Posts: 15,039
Earliest Date: 07/17/2020
Latest Date: 04/27/2021
8kun
Raw Data Collected: 2.2 MB
Processed Data: 1.1 MB
Unofficial API: bibanon/py8chan
Total Posts: 709
Earliest Date: 02/05/2020
Latest Date: 04/20/2021
Flickr
Raw Data Collected: 661 KB
Processed Data: 71 KB
Official API: Flickr API
Total Photo Tags: 7,027
Earliest Date: 02/24/2021
Latest Date: 04/20/2021
Gab
Raw Data Collected: 204.8 MB
Processed Data: 149.4 MB
Unofficial API: ChrisStevens/garc
Total Posts: 171,288
Earliest Date: 01/01/2020
Latest Date: 02/24/2021
Mastodon
Raw Data Collected: 37.8 MB
Processed Data: 27.6 MB
Official API: Mastodon API
Total Posts: 77,054
Earliest Date: 01/08/2020
Latest Date: 04/20/2021
Parler
Raw Data Collected: 154.7 MB
Processed Data: 54.8 MB
Unofficial API: KonradlT/parler-py-api
Total Posts: 80,742
Earliest Date: 04/05/2020
Latest Date: 01/04/2021
Raw Data Collected: 8.75 GB
Processed Data: 6.91 GB
Unofficial API: pushshift/api
Total Posts: 11,209,463
Earliest Date: 01/01/2020
Latest Date: 03/31/2021
Tumblr
Raw Data Collected: 30.7 MB
Processed Data: 12.2 MB
Official API: Tumblr API
Total Posts: 82,690
Earliest Date: 01/01/2020
Latest Date: 03/31/2021
Raw Data Collected: 779 MB
Processed Data: 1.03 GB
Official API: Twitter API v2
Total Posts: 3,054,857
Earliest Date: 10/17/2020
Latest Date: 04/27/2021
YouTube
Raw Data Collected: 64.1 MB
Processed Data: 50.9 MB
Unofficial API: YouTube Data API v3
Total Posts: 271,550
Earliest Date: 03/01/2020
Latest Date: 04/22/2021