Application of Database Systems and Analytics to Covid19 Disease

The Novel Coronavirus (Covid19), a virus that has effected approximately 3 Million people across the world has been analysed in this project.

The analysis was performed by gathering related datasets of four (4) different formats: .xml, .json, .csv and web scrapped data.
Pre-processing: web-scrapping data off the two websites, extracted XML data, stored these data into mongoDB, cleaned and saved the processed data into PostgreSQL
- Wesbsite 1: https://www.worldometers.info/coronavirus/
- Wesbsite 2: https://data.medicare.gov/widgets/xubh-q36u
- CSV Dataset: https://www.kaggle.com/sudalairajkumar/novel-corona-virus-2019-dataset
- XML dataset: https://opendata.ecdc.europa.eu/covid19/casedistribution/xml/
- JSON dataset: https://www.kaggle.com/allen-institute-for-ai/CORD-19-research-challenge
After fetching, pre-processing and combining all these datasets, visualization on the Covid19 data was carried out and a forecast on new cases and deaths was predicted using Auto-ARIMA model.
The visualizationsprovides understanding of the patterns of the impact of Covid-19 outbreak over different Countries, number of deaths, number of people recovered, and regions highly effected.

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
CSV		CSV
CleanedData_TobiEk		CleanedData_TobiEk
DB Code		DB Code
JSON		JSON
Visualization		Visualization
Web_scrapping		Web_scrapping
XML		XML
Application of Database Systems and Analytics to Covid19 Disease.pdf		Application of Database Systems and Analytics to Covid19 Disease.pdf
CSV_clean_process.py		CSV_clean_process.py
Covid19_Tweet_json_fetch_api_store_in_postgres.py		Covid19_Tweet_json_fetch_api_store_in_postgres.py
Covid19_data_postgres_relation_query_on_tables.py		Covid19_data_postgres_relation_query_on_tables.py
Project_Webscrapping_v1.py		Project_Webscrapping_v1.py
Project_XML-to-db.py		Project_XML-to-db.py
README.md		README.md

Provide feedback