COVID-19 Data Exploration
SQL | Excel | Tableau
Background
The primary aim of the project was to explore and gain insights about the COVID-19 pandemic in order to comprehend it’s impact on public health globally between 2020 and 2021 using SQL, Excel and Tableau. The data was obtained from Our World in Data.
This is a follow up to a COVID-19 project I did earlier where I used SQL to do a deep dive into COVID cases, hospitalization rates, infection rates, death rates, population, and vaccination rates across different countries and continents. You can view the SQL analysis I did here.
Focus Questions
★ What are the total number of COVID cases globally since it’s inception till now?
★ What is the percentage of the total population of each country that have been infected with COVID?
★ What are the total number of people who have died as result of COVID globally?
★ What percentage of the people died out of the total number of people who got COVID in each country?
Data Analysis Process
SQL
I uploaded the dataset to Microsoft SQL Server Management Studio and inserted it into a COVID database I created.
I calculated the total cases, total deaths and death percentage using some aggregate functions in a query.
I used the CAST function in the query to change the the ‘new_deaths’ column from a NVARCHAR data type to an INT data type so that the calculation could work.
The result
I calculated the total deaths in each continent. I removed 'World', 'European Union', 'International' from the final result because they are not continents or relevant to our analysis.
The result:
I proceeded to determine the total number of infected people and the percentage of infected people in the population of every country.
The result:
Lastly, I decided to track the daily increases/decreases of the number of infected people and the percentage of infected people in the population of every country.
The result:
Excel
The Microsoft SQL Server couldn’t be integrated with Tableau Public. So I copied all the tables containing from the SQL server to an Excel file and saved them there.
Additionally, I replaced the cells with NULL values with the number 0 to prevent errors from happening in the final analysis and visualization process as all the values in the relevant columns & cells need to be numeric. In the final sheet, I also changed the date column from an unclear format to a date format that is understandable and usable for analysis.
Tableau
I uploaded Excel file into the Tableau Public app to complete analysis and visualization process.
I created four Tableau worksheets to analyze the data and answer our focus questions. I put all the four sheets into one dashboard.