Abstract
Users prefer to do e-banking and e-shopping now-a-days because of the exponential growth of the internet. Because of this paradigm shift, hackers are finding umpteen ways to steal our personal information and critical details like
details of debit and credit cards, by disguising themselves as reputed websites, just by changing the spelling or making minor modifications to the URL. Identifying whether an URL is benign or malicious is a challenging job, because it makes use of the weakness of the user. While there are several works carried out to detect phishing websites, they only use heuristic methods and list based techniques and therefore couldn’t avoid phishing effectively. In this paper an anti-phishing system was proposed to protect the users. It uses an ensemble model that uses both LSTM and CNN with a massive data set containing nearly 2,00,000 URLs, that is balanced. After analyzing the accuracy of different existing approaches, it has been found that the ensemble model that uses both LSTM and CNN performed better with an accuracy of 96% and the precision is 97% respectively which is far better than the existing solutions.
details of debit and credit cards, by disguising themselves as reputed websites, just by changing the spelling or making minor modifications to the URL. Identifying whether an URL is benign or malicious is a challenging job, because it makes use of the weakness of the user. While there are several works carried out to detect phishing websites, they only use heuristic methods and list based techniques and therefore couldn’t avoid phishing effectively. In this paper an anti-phishing system was proposed to protect the users. It uses an ensemble model that uses both LSTM and CNN with a massive data set containing nearly 2,00,000 URLs, that is balanced. After analyzing the accuracy of different existing approaches, it has been found that the ensemble model that uses both LSTM and CNN performed better with an accuracy of 96% and the precision is 97% respectively which is far better than the existing solutions.
Original language | English |
---|---|
Title of host publication | 2020 IEEE International Conference for Innovation in Technology (INOCON) |
Place of Publication | Bangluru |
Publisher | IEEE |
Number of pages | 5 |
ISBN (Electronic) | 978-1-7281-9744-9 |
ISBN (Print) | 978-1-7281-9745-6 |
DOIs | |
Publication status | Published - 1 Jan 2021 |
Event | 2020 IEEE International Conference for Innovation in Technology - Bangluru, India Duration: 6 Nov 2020 → 8 Nov 2020 http://inoconf.org/ |
Conference
Conference | 2020 IEEE International Conference for Innovation in Technology |
---|---|
Abbreviated title | INOCON 2020 |
Country/Territory | India |
City | Bangluru |
Period | 6/11/20 → 8/11/20 |
Internet address |
Keywords
- Phishing
- CNN
- RNN
- Deep Learning
- Classification
- Security