Skip to content

Grant APA Citation: Stubbs-Richardson, M., Anreddy, S., & Porter, B. (2020). RAPID: Analyses of emotions expressed in social media and forums during the COVID-19 pandemic (Award No. 2031246). National Science Foundation.

Website APA Citation: Stubbs-Richardson, M., Anreddy, S., & Porter, B. (2022, June 29). COVID-19 online prevalence of emotions in institutions database. COPE-ID. Copeid.ssrc.msstate.edu.

Database APA Citation: Stubbs-Richardson, M., Anreddy, S., & Porter, B. (2022, June 29). COVID-19 online prevalence of emotions in institutions database. Data Science for the Social Sciences Laboratory in the Social Science Research Center at Mississippi State University.

COVID-19 ONLINE PREVALENCE OF EMOTIONS IN INSTITUTIONS DATABASE

The content included in the COVID-19 Online Prevalence of Emotions in Institutions Database (COPE-ID) includes data mined from social media and forum-related posts about the COVID-19 pandemic from January 2020 to April 2021 using the following keywords: covid-19, sars-cov-2, corona, coronavirus, coronavirus, coronaviruses, social distancing, quarantine, covid19, pandemic, virus, and #socialdistancing. Each data record is a post or comment about COVID-19 and includes associated meta-data that varies by platform. Supporting CSV files are provided per platform type and have the related meta-data per post.

Below is information about the 10 platforms that COVID-19-related data was mined from using Application Programming Interfaces (APIs), along with details about date ranges, the number of unique posts collected from the sites, and documentation per platform that explains the data files. 

4chan

Raw Data Collected: 5.9 MB
Processed Data: 4.4 MB
Unofficial API: 4chan/4chan-APl
Total Posts: 15,039
Earliest Date: 07/17/2020
Latest Date: 04/27/2021

8kun

Raw Data Collected: 2.2 MB
Processed Data: 1.1 MB
Unofficial API: bibanon/py8chan
Total Posts: 709
Earliest Date: 02/05/2020
Latest Date: 04/20/2021

Flickr

Raw Data Collected: 661 KB
Processed Data: 71 KB
Official API: Flickr API
Total Photo Tags: 7,027
Earliest Date: 02/24/2021
Latest Date: 04/20/2021

Gab

Raw Data Collected: 204.8 MB
Processed Data: 149.4 MB
Unofficial API: ChrisStevens/garc 
Total Posts: 171,288
Earliest Date: 01/01/2020
Latest Date: 02/24/2021

Mastodon

Raw Data Collected: 37.8 MB
Processed Data: 27.6 MB
Official API: Mastodon API
Total Posts: 77,054
Earliest Date: 01/08/2020
Latest Date: 04/20/2021

Parler

Raw Data Collected: 154.7 MB
Processed Data: 54.8 MB
Unofficial API: KonradlT/parler-py-api
Total Posts: 80,742
Earliest Date: 04/05/2020
Latest Date: 01/04/2021

Reddit

Raw Data Collected: 8.75 GB
Processed Data: 6.91 GB
Unofficial API: pushshift/api
Total Posts: 11,209,463
Earliest Date: 01/01/2020
Latest Date: 03/31/2021

Tumblr

Raw Data Collected: 30.7 MB
Processed Data: 12.2 MB
Official API: Tumblr API
Total Posts: 82,690
Earliest Date: 01/01/2020
Latest Date: 03/31/2021

Twitter

Raw Data Collected: 779 MB
Processed Data: 1.03 GB
Official API: Twitter API v2
Total Posts: 3,054,857
Earliest Date: 10/17/2020
Latest Date: 04/27/2021

YouTube

Raw Data Collected: 64.1 MB
Processed Data: 50.9 MB
Unofficial API: YouTube Data API v3
Total Posts: 271,550
Earliest Date: 03/01/2020
Latest Date: 04/22/2021

Questions about the data collection process, accessing, downloading, or analyzing the database files?

Email Our Team!