Leader Behaviour Prediction

This project will deal with extracting and gathering information about the behaviour/ bad work (corresponding to predefined adjectives ) of a leader/ representative by constantly scraping news website.

We have converted the output to a JSON file.

Installing required libraries

sudo pip install requirements.txt

Scraping Times of India website

I have scraped Times Of India Website specially for this purpose.

The dataset got after scraping Times of India website

This dataset have the details of the scrapped article. We have to scrap the text and get the names. Then we have to match the details of the adjective with the matched names that is got.

The dataset is present in the path :

LeaderBehaviour/leaderBehaviour/leaderBehaviour/spiders/newsTOI.sqlite

Scraped names of the members of parliaments in US :

LeaderBehaviour/getUSNames/getUSNames/spiders/getUSNames.json

Scraped the names of the members of parliaments in India :

LeaderBehaviour/getIndianPolNames/getIndianPolNames/spiders/getIndianPolNames.json

Additional Objectives :

* used headers/ user-agent in scrapy.
* need to use proxy/ integrate with Tor to make it completely untraceable.

Possible name extraction from the extracted text :

LeaderBehaviour/leaderBehaviour/leaderBehaviour/spiders/extractNamesTOI.py
LeaderBehaviour/leaderBehaviour/leaderBehaviour/spiders/probable_names_extracted.json

Note

Go to the directory real_shit, then copy the scrapTOI.sqlite, then run *** python get_neg.py***.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Leader Behaviour Prediction

Installing required libraries

Scraping Times of India website

The dataset got after scraping Times of India website

Scraped names of the members of parliaments in US :

Scraped the names of the members of parliaments in India :

Additional Objectives :

Possible name extraction from the extracted text :

Note

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
getIndianPolNames		getIndianPolNames
getUSNames		getUSNames
leaderBehaviour		leaderBehaviour
real_shit		real_shit
LICENSE		LICENSE
README.md		README.md
output_data.json		output_data.json
requirements.txt		requirements.txt

License

Jimut123/LeaderBehaviour

Folders and files

Latest commit

History

Repository files navigation

Leader Behaviour Prediction

Installing required libraries

Scraping Times of India website

The dataset got after scraping Times of India website

Scraped names of the members of parliaments in US :

Scraped the names of the members of parliaments in India :

Additional Objectives :

Possible name extraction from the extracted text :

Note

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages