This project analyzes U.S. COVID-19 time series data from the Johns Hopkins University Center for Systems Science and Engineering (CSSE) to better understand the factors contributing to COVID-19 deaths. It also incorporates vaccination data from the GovEx repository to enhance predictive modeling.
As of March 10, 2023, the Johns Hopkins Coronavirus Resource Center ceased collection and reporting of global COVID-19 data, making this analysis a retrospective look at pandemic trends.
- Explore trends in COVID-19 cases and deaths across U.S. states.
- Build linear regression models to understand the relationship between case counts and mortality.
- Incorporate vaccination data to assess its impact on COVID-19 outcomes.
- Identify bias and limitations in the available data.
-
COVID-19 Time Series (Cases & Deaths)
Source: Johns Hopkins CSSE GitHub -
Vaccination Data
Source: GovEx GitHub
-
Clone the repository (or download the project folder):
git clone https://github.com/redswimmer/covid-data-analysis.git
-
Open RStudio and load the project folder.
-
Install required packages (if not already installed):
install.packages(c( "lubridate", "corrplot", "tidyverse" ))
-
Run the R Markdown file.