Using machine learning techniques to identify rare cyber‐attacks on the UNSW‐NB15 dataset

S Bagui, E Kalaimannan, S Bagui, D Nandi…�- Security and�…, 2019 - Wiley Online Library
Security and Privacy, 2019Wiley Online Library
This paper uses a hybrid feature selection process and classification techniques to classify
cyber‐attacks in the UNSW‐NB15 dataset. A combination of k‐means clustering, and a
correlation‐based feature selection, were used to come up with an optimum subset of
features and then two classification techniques, one probabilistic, Na�ve Bayes (NB), and a
second, based on decision trees (J48), were employed. Our results show that this hybrid
feature selection method in combination with the NB model was able to improve the�…
Abstract
This paper uses a hybrid feature selection process and classification techniques to classify cyber‐attacks in the UNSW‐NB15 dataset. A combination of k‐means clustering, and a correlation‐based feature selection, were used to come up with an optimum subset of features and then two classification techniques, one probabilistic, Na�ve Bayes (NB), and a second, based on decision trees (J48), were employed. Our results show that this hybrid feature selection method in combination with the NB model was able to improve the classification accuracy of most attacks, especially the rare attacks. The false alarm rates were lower for most of the attacks, and particularly the rare attacks, with this combination of feature selection and the NB model. The J48 decision tree model, however, did not perform any better with the feature selection, but its classification rate for all attack families was already very high, with or without feature selection.
Wiley Online Library
Showing the best result for this search. See all results