AI@ntiPhish — Machine Learning Mechanisms for Cyber-Phishing Attack

Yu-Hung CHEN; Jiann-Liang CHEN

doi:10.1587/transinf.2018NTI0001

IEICE TRANSACTIONS on Information

AI@ntiPhish — Machine Learning Mechanisms for Cyber-Phishing Attack

Yu-Hung CHEN, Jiann-Liang CHEN

Full Text Views

0

Cite this

Summary :

This study proposes a novel machine learning architecture and various learning algorithms to build-in anti-phishing services for avoiding cyber-phishing attack. For the rapid develop of information technology, hackers engage in cyber-phishing attack to steal important personal information, which draws information security concerns. The prevention of phishing website involves in various aspect, for example, user training, public awareness, fraudulent phishing, etc. However, recent phishing research has mainly focused on preventing fraudulent phishing and relied on manual identification that is inefficient for real-time detection systems. In this study, we used methods such as ANOVA, X², and information gain to evaluate features. Then, we filtered out the unrelated features and obtained the top 28 most related features as the features to use for the training and evaluation of traditional machine learning algorithms, such as Support Vector Machine (SVM) with linear or rbf kernels, Logistic Regression (LR), Decision tree, and K-Nearest Neighbor (KNN). This research also evaluated the above algorithms with the ensemble learning concept by combining multiple classifiers, such as Adaboost, bagging, and voting. Finally, the eXtreme Gradient Boosting (XGBoost) model exhibited the best performance of 99.2%, among the algorithms considered in this study.

Publication: IEICE TRANSACTIONS on Information Vol.E102-D No.5 pp.878-887

Publication Date: 2019/05/01

Publicized: 2019/02/18

Online ISSN: 1745-1361

DOI: 10.1587/transinf.2018NTI0001

Type of Manuscript: Special Section INVITED PAPER (Special Section on the Architectures, Protocols, and Applications for the Future Internet)

Category

Authors

Yu-Hung CHEN
National Taiwan University of Science and Technology
Jiann-Liang CHEN
National Taiwan University of Science and Technology

Keyword

anti-phishing, machine learning algorithm, ensemble learning mechanism, cyber attack

Cite this

Copy

Yu-Hung CHEN, Jiann-Liang CHEN, "AI@ntiPhish — Machine Learning Mechanisms for Cyber-Phishing Attack" in IEICE TRANSACTIONS on Information, vol. E102-D, no. 5, pp. 878-887, May 2019, doi: 10.1587/transinf.2018NTI0001.
Abstract: This study proposes a novel machine learning architecture and various learning algorithms to build-in anti-phishing services for avoiding cyber-phishing attack. For the rapid develop of information technology, hackers engage in cyber-phishing attack to steal important personal information, which draws information security concerns. The prevention of phishing website involves in various aspect, for example, user training, public awareness, fraudulent phishing, etc. However, recent phishing research has mainly focused on preventing fraudulent phishing and relied on manual identification that is inefficient for real-time detection systems. In this study, we used methods such as ANOVA, X², and information gain to evaluate features. Then, we filtered out the unrelated features and obtained the top 28 most related features as the features to use for the training and evaluation of traditional machine learning algorithms, such as Support Vector Machine (SVM) with linear or rbf kernels, Logistic Regression (LR), Decision tree, and K-Nearest Neighbor (KNN). This research also evaluated the above algorithms with the ensemble learning concept by combining multiple classifiers, such as Adaboost, bagging, and voting. Finally, the eXtreme Gradient Boosting (XGBoost) model exhibited the best performance of 99.2%, among the algorithms considered in this study.
URL: https://global.ieice.org/en_transactions/information/10.1587/transinf.2018NTI0001/_p

Copy

@ARTICLE{e102-d_5_878,
author={Yu-Hung CHEN, Jiann-Liang CHEN, },
journal={IEICE TRANSACTIONS on Information},
title={AI@ntiPhish — Machine Learning Mechanisms for Cyber-Phishing Attack},
year={2019},
volume={E102-D},
number={5},
pages={878-887},
abstract={This study proposes a novel machine learning architecture and various learning algorithms to build-in anti-phishing services for avoiding cyber-phishing attack. For the rapid develop of information technology, hackers engage in cyber-phishing attack to steal important personal information, which draws information security concerns. The prevention of phishing website involves in various aspect, for example, user training, public awareness, fraudulent phishing, etc. However, recent phishing research has mainly focused on preventing fraudulent phishing and relied on manual identification that is inefficient for real-time detection systems. In this study, we used methods such as ANOVA, X², and information gain to evaluate features. Then, we filtered out the unrelated features and obtained the top 28 most related features as the features to use for the training and evaluation of traditional machine learning algorithms, such as Support Vector Machine (SVM) with linear or rbf kernels, Logistic Regression (LR), Decision tree, and K-Nearest Neighbor (KNN). This research also evaluated the above algorithms with the ensemble learning concept by combining multiple classifiers, such as Adaboost, bagging, and voting. Finally, the eXtreme Gradient Boosting (XGBoost) model exhibited the best performance of 99.2%, among the algorithms considered in this study.},
keywords={},
doi={10.1587/transinf.2018NTI0001},
ISSN={1745-1361},
month={May},}

Copy

TY - JOUR
TI - AI@ntiPhish — Machine Learning Mechanisms for Cyber-Phishing Attack
T2 - IEICE TRANSACTIONS on Information
SP - 878
EP - 887
AU - Yu-Hung CHEN
AU - Jiann-Liang CHEN
PY - 2019
DO - 10.1587/transinf.2018NTI0001
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E102-D
IS - 5
JA - IEICE TRANSACTIONS on Information
Y1 - May 2019
AB - This study proposes a novel machine learning architecture and various learning algorithms to build-in anti-phishing services for avoiding cyber-phishing attack. For the rapid develop of information technology, hackers engage in cyber-phishing attack to steal important personal information, which draws information security concerns. The prevention of phishing website involves in various aspect, for example, user training, public awareness, fraudulent phishing, etc. However, recent phishing research has mainly focused on preventing fraudulent phishing and relied on manual identification that is inefficient for real-time detection systems. In this study, we used methods such as ANOVA, X², and information gain to evaluate features. Then, we filtered out the unrelated features and obtained the top 28 most related features as the features to use for the training and evaluation of traditional machine learning algorithms, such as Support Vector Machine (SVM) with linear or rbf kernels, Logistic Regression (LR), Decision tree, and K-Nearest Neighbor (KNN). This research also evaluated the above algorithms with the ensemble learning concept by combining multiple classifiers, such as Adaboost, bagging, and voting. Finally, the eXtreme Gradient Boosting (XGBoost) model exhibited the best performance of 99.2%, among the algorithms considered in this study.
ER -

IEICE TRANSACTIONS on Information