Explore the COVID-19 tweets, analyse the geographical distribution and the sentiment in tweets
In this project we are using a tweet dataset has been collected using Twitter API and a Python script. A query for this high-frequency hashtag (#covid19) is run on a daily basis for a certain time period, to collect a larger number of tweets samples. The collection script can be found here: https://github.com/gabrielpreda/covid-19-tweets The tweets have #covid19 hashtag. Collection started on 25/7/2020, with an initial 17k batch and will continue on a daily basis.
This project requires Python 3.6 and the following Python libraries installed:
NumPy Pandas Matplotlib NLTK You will also need to have software installed to run and execute a Jupyter Notebook
If you do not have Python installed yet, it is highly recommended that you install the Anaconda distribution of Python, which already has the above packages and more included. Make sure that you select Python 3.x installer.
Thanks to Kaggle for providing access to the data used in this project.