URL Based Spam Classification Using Machine Learning

We have tried to implement the paper "Beyond Blacklists: Learning to Detect Malicious Web Sites from Suspicious URLs" by "Justin Ma, Lawrence K. Saul, Stefan Savage, Geoffrey M. Voelker" of Department of Computer Science and Engineering of the University of California, San Diego.

You can download the paper here.

Initially in the paper the models which have been used include - Logistic Regression, SVM and Naive Bayes. We have tried to extend the paper by using models like - Gradient Boosting, Random Forest and Decision Trees.

To run the code you will need Jupyter-Notebook which is available as a package of the Anaconda Python distribution. Once you have it installed, open the initial implementation and final implementation folders and open the codes in the notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Datasets of concern		Datasets of concern
Documents		Documents
Extra Stuff		Extra Stuff
Final Implementation		Final Implementation
Initial Implementation		Initial Implementation
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

URL Based Spam Classification Using Machine Learning

About

Releases

Packages

Languages

Swapneel01/URL-Based-Spam-Classification-Using-Machine-Learning

Folders and files

Latest commit

History

Repository files navigation

URL Based Spam Classification Using Machine Learning

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages