nlp-predict-tweets-about-real-disasters

Problem Statement:

Twitter has become an important communication channel in times of emergency. The ubiquitousness of smartphones enables people to announce an emergency they’re observing in real-time. Because of this, more agencies are interested in programatically monitoring Twitter (i.e. disaster relief organizations and news agencies). But, it’s not always clear whether a person’s words are actually announcing a disaster. In this competition, you’re challenged to build a machine learning model that predicts which Tweets are about real disasters and which one’s aren’t. You’ll have access to a dataset of 10,000 tweets that were hand classified.

Competition details and dataset can be found in kaggle: https://www.kaggle.com/c/nlp-getting-started

Approach Taken:

Performed text pre-processing and replaced constactions (e.g. wouldn't to would not) in dataset.
Used BERT pre-trainted model bert-base-uncased with maxlength 512.
Identified optimal learning rate (3e-5) and fine-tuned using one cycle policy and the optimal learning rate.
Evaluated on Test dataset. Confusion matrix is as below:

Performed prediction on final dataset and submitted. Got Public Score: 0.81428

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
README.md		README.md
nlp-predict-tweets-about-real-disasters.ipynb		nlp-predict-tweets-about-real-disasters.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nlp-predict-tweets-about-real-disasters

Problem Statement:

Approach Taken:

About

Releases

Packages

Languages

anikch/nlp-predict-tweets-about-real-disasters

Folders and files

Latest commit

History

Repository files navigation

nlp-predict-tweets-about-real-disasters

Problem Statement:

Approach Taken:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages