Skip to content

southern-cross-ai/twitter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

62 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Overview of the Dataset

This repository contains various datasets focused on tweets with locations in Australia.

1. Tweets on Australian Election

2. IEEE

3. Australian Cities Tweets

  • Time Span/Geopolitical Info: Focuses on tweets from various cities across Australia
  • Data Source/Credit: Dataset provided by Kaggle, available https://www.kaggle.com/datasets/wjia26/australian-cities-tweets
  • Cleaned Status: The dataset is cleaned.
  • License: Please refer to the Kaggle page for specific licensing details.

4. Australian Cricket Tweets

  • Time Span/Geopolitical Info: Contains tweets related to cricket
  • Data Source/Credit: Dataset provided by Kaggle, available https://www.kaggle.com/datasets/gpreda/cricket-tweets
  • Cleaned Status: The dataset is cleaned.
  • License: Please refer to the Kaggle page for specific licensing details.

5. Tweets Using the Hashtag #australianvalues (22-27 April 2017)

6. Lpheada: Labelled Public Health Dataset

7. TBCOV: Two Billion Multilingual COVID-19 Tweets with Sentiment, Entity, Geo, and Gender Labels

  • Time Span/Geopolitical Info: This dataset contains tweets related to the COVID-19 pandemic over a 14-month period from February 1st, 2020 till March 31st, 2021.
  • Data Source/Credit: Available on https://crisisnlp.qcri.org/tbcov, https://github.com/CrisisComputing/TBCOV
  • Cleaned Status: The dataset is not cleaned. Tweets need to be hydrated and cleaned.
  • License: Please refer to the GitHub repository for licensing details.

8. (🌇Sunset) 🇺🇦 Ukraine Conflict Twitter Dataset

9. Tweets on ChatGPT - #ChatGPT

Access to the Dataset

To access these datasets:

  1. For Kaggle Datasets:
  • Visit the provided Kaggle URLs.
  • Log in to your Kaggle account (or create one if you don't have it).
  • Click on the "Download" button on the dataset page.
  1. For IEEE DataPort Dataset:
  • Go to the IEEE DataPort URL.
  • Sign in or register to access the dataset.
  • Follow the instructions to download the data.
  1. For Figshare Dataset:
  • Access the dataset through the Figshare link.
  • Download the dataset directly by clicking the file link.
  1. For GitHub Repository:
  • Navigate to the GitHub repository using the provided link.
  • Clone the repository using the git clone command or download the dataset files directly.

License of the Repo

This repository is licensed under the MIT License. For more details on the licensing of individual datasets, please refer to the specific dataset source pages mentioned above.