This paper is published in Volume-7, Issue-3, 2021
Area
Computer Engineering
Author
Hardik Shah, Dharmik Timbadia
Org/Univ
Rajiv Gandhi Institute of Technology, Mumbai, Maharashtra, India
Pub. Date
22 June, 2021
Paper ID
V7I3-2006
Publisher
Keywords
Scam, Phishing, Random Forest, Naïve Bayes, XG Boost, Data loss

Citationsacebook

IEEE
Hardik Shah, Dharmik Timbadia. Phishing Websites Classification Based on Machine Learning, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.

APA
Hardik Shah, Dharmik Timbadia (2021). Phishing Websites Classification Based on Machine Learning. International Journal of Advance Research, Ideas and Innovations in Technology, 7(3) www.IJARIIT.com.

MLA
Hardik Shah, Dharmik Timbadia. "Phishing Websites Classification Based on Machine Learning." International Journal of Advance Research, Ideas and Innovations in Technology 7.3 (2021). www.IJARIIT.com.

Abstract

Phishing has been a huge problem since the evolution of the Internet. It is kind of an Internet scam. Therefore, no antivirus or any kind of technical protection can completely eradicate this. But there has been research going on and conducted from intellects across the world to fight this online scam. Researches are coming up with different approaches to deal with this problem. The two main focused approaches taken by researchers to tackle phishing are Black Listing and Machine Learning. Machine Learning is a new and innovative way to tackle phishing. For this thesis I went with Machine Learning and heuristic based approach to tackle phishing. This thesis consist of a comparative study of different Machine Learning algorithm like Logistic Regression and ensemble algorithms like Adaboost and Gradientboost in order to know if ensemble algorithms can do a better prediction rather than standard machine Learning algorithms. The result obtained by ensemble algorithms were good but not as promising as expected.