A novel intrusion detection system based on feature generation with visualization strategy

doi:10.1016/j.eswa.2013.12.048

Expert Systems with Applications

Volume 41, Issue 9, July 2014, Pages 4139-4147

https://doi.org/10.1016/j.eswa.2013.12.048 Get rights and content

Highlights

•
This work reflects our research in feature selection with visualization strategy.
•
We design a four angle star and generate numerical features for data in KDDcup99.
•
We better generate numerical features beyond the traditional features.

Abstract

In this paper, a four-angle-star based visualized feature generation approach, FASVFG, is proposed to evaluate the distance between samples in a 5-class classification problem. Based on the four angle star image, numerical features are generated for network visit data from KDDcup99, and an efficient intrusion detection system with less features is proposed. The FASVFG-based classifier achieves a high generalization accuracy of 94.3555% in validation experiment, and the average Mathews correlation coefficient reaches 0.8858.

Introduction

Currently the quick developing Internet has brought great convenience for people globally to share the information. However, owing to the explosive use of networks, network attack becomes an urgent security issues (Liao, Lin, Lin, & Tung, 2013). After a Denial-of-service (DOS) attack upon American Yahoo that caused a halt of server and resulted in inestimable economic losses, investigator reported that the number of illegal code signature increased more than 256% over the previous year (Tjhai, Furnell, Papadaki, & Clarke, 2010).

To supervise user’s network connection and prevent malicious attacks, a firewall setting was the initial option. Unfortunately, due to the weakness of connection analysis, firework could hardly discern all of the malicious attacks. Therefore, considerable attentions were paid to intrusion detection system (IDS), which is designed to classify user’s activity as either normal or anomalous behavior by examining the dynamic characteristics of network connection records (Panda, Abraham, & Patra, 2012) and has been an essential part of a network security architecture nowadays (Tjhai et al., 2010). In general, architecture of IDS is divided into two categories in terms of analysis method: the anomaly detection and the misuse detection (Mukherjee & Sharma, 2012). While misuse detection aims to detect intrusions through established patterns of well-known attacks, anomaly detection intends to assort visitors into either legal or illegal users by comparing visitors’ behaviors in an established profile that contains historical normal behaviors data. However, despite the fact that anomaly detection achieves high performance of detecting new attacks, it leads high misjudgment rate as well. Meanwhile, misuse detection performs less ideal in identifying the unknown attacks even though its misjudgment rate is low.

For the purpose of enhancing detection precision and detection stability, various artificial intelligence algorithms are investigated to improve IDS, such as fuzzy logic (Kumar & Selvakumar, 2013), K-nearest neighbor (KNN) (Tsai and Lin, 2010; Su, 2011), support vector machine (SVM) (Joseph et al., 2010, Mohammed and Sulaiman, 2012, Seongjun et al., 2013), artificial neural networks (ANN) (Wang, Hao, Ma, & Huang, 2010), Naïve Bayes networks (Koc et al., 2012, Mukherjee and Sharma, 2012, Sanjai and Gao., 2014), decision tree (Gisung et al., 2014, Sindhu et al., 2012), genetic algorithm (GA) (Abadeh et al., 2011, Amin and Radu, 2013), self-organizing maps (SOM) (Tjhai et al., 2010), Markov chains (Seongjun et al., 2013), Cost matrix (Aikaterini & Christos, 2013), rough set (Nandita, Jaydeep, Jaya, & Moumita, 2013), ant colony algorithm (Feng, Zhang, Hu, & Huang, 2013) and principle component analysis (PCA) (Arunna, Tan, He, Priyadarsi, & Liu, 2013). These artificial intelligent algorithms usually offer an automatic mechanism and enhance the performance of IDSs. Among most mechanisms, feature selection is always a essential strategy which aims to decrease the training- and predicting-time, deal with data redundancy and irrelevancy, and finally enhance the IDS system.

For classification problem with high dimensional data, the purpose of feature selection is to decrease the computational time of the classification model and enhance the classification performance through removing redundant and irrelevant attributes. Generally, its significance could be shown in two respects: (a) Removing irrelevant and redundant features as well as filtering out noise. (b) Optimizing the procedure of finding a subset of features to a proposed desirable approach. Specifically, methods for feature selection can be divided into two categories: filter method and wrapper method (Li et al., 2012, Mukherjee and Sharma, 2012). Among the two methods, the former is developed to determine which features should be retained through analyzing the contribution of sole features to the classification performance, and the latter aims to select critical features by estimating the resultant probability of error after removing certain features (Li et al., 2012).

Moreover, both wrapper and filter method and hybrid feature selection method along with various artificial intelligence approaches are proposed to achieve better performance. For example, in 2012, Lin, Ying, Lee, and Lee (2012) selected 23 critical features in KDDcup99 data set through support vector machine (SVM) and simulated annealing (SA), and obtained 99.96% classification accuracy. In the same year, by applying gradually feature removal method (GFR) to IDS, Li et al. (2012) extracted 19 important features and achieved 98.6429% classification accuracy in 10-fold cross validation. Furthermore, by adopting triangle area based nearest neighbors (TANN) to IDS, Tsai and Lin (2010) succeeds in generating 10 triangle areas formed by the data and the 5-class cluster centers to replace the 41 original features and outperforming the other three algorithms, e.g., KNN, SVM and the combination of K-means and KNN.

In addition, in the time of explosive development of Big Data and storage capacity, the data analysis tasks are becoming increasingly difficult and challenging. Meanwhile, the amount and the complexity of the data available are also increased significantly. Due to the limitations in human’s cognitive and perceptual ability, it is necessary to adopt new ideas in data analysis so as to better cope with the massive knowledge discovery in Big Data. Under such circumstance, visualization is developed, which is a crucial component of research presentation and possesses two principal advantages: (a) Merging huge amounts of data into simple and effective graphics. (b) Providing efficient ways to analyze the information exist in the data sets in direct, easy-to-understand formats (Kelleher & Wagener, 2011). Besides, visualization serves two primary purposes: data analysis (Kelleher and Wagener, 2011, Shieh and Liao, 2012, Xu et al., 2010) and data presentation (Derick et al., 2013, Kelleher and Wagener, 2011).

Over the past decades, visualization has been developing at a startling speed. With the emergence of computers vision, visualization has been incorporated in lots of fields and promoted the understanding of complicated concepts and ideas. An illuminative example was given in Bioinformatics, when Parkinson and Blaxter (2003) applied visualization to model a 4-class classification and resolved a gene classification problem. By mapping a certain gene into a triangular phase space, whose three vertexes represent three selected distinct gene categories, the position of the gene within the phase space indicates its relationship to each of the three selected data sets and helps determine the class the data belongs to. In geological science, Xie and Seng (2012) adopted three-dimensional (3D) visualization to synthesize and process geological engineering data and proved that the application of 3D visualization technology is helpful in improving the utilization and management of the geological engineering data.

Inspired by the applications of visualization strategy, especially the one in Parkinson and Blaxter’s work (2003), visualization technology is considered to be applied in a new and novel intrusion detection system in this paper. We decrease the complexity of the data used in IDS and make the process of classification more intuitive by mapping the experimental data to a certain graph and generating new features to replace the original ones.

The purpose of this paper is to develop a novel intrusion detection system that combines the idea of feature generation and visualization technology. Basically, this work is a meaningful attempt that aims to adopt visualization strategy to achieve data presentation, feature selection, feature reduction, and classifier enhancement. We use a four star graph to give a intuitive simulation of high dimension data classification for IDS. Finally, generation of new visualized numerical features decrease the dimensionality of the data 41 to 16 or 4, and increase the computation speed of the new IDS. Visualization is an intuitive way for feature selection and feature reduction. For better understanding the enhanced performance, the proposed novel IDS is also compared with three other intrusion detection systems, and results show the proposed IDS outperforms the other three IDSs.

Section snippets

Data set

The KDDcup99 data set (downloaded from http://www.sigkdd.org/kddcup/index.php?section=1999&method=data) is a benchmark data set for intrusion detection. This data set consists of 9 weeks intrusion simulation in the US Air Force environment, and includes two versions, namely, the full data set (4,898,431 recordings, 18 M, 743 M Uncompressed), the 10% subset (494,307 recordings, 2.1 M, 75 M Uncompressed). For the sake of simplicity, the 10% subset is chosen as the experimental data set.

The whole

Evaluation criteria for prediction

Seven performance measurements to estimate the efficiency of classifier are applied in this research. The definitions are given as follows:

•
True positive (TP): The number of attack that correctly classified as attack.
•
True negative (TN): The number of normal that correctly detected as normal.
•
False positive (FP): The number of normal that falsely classified as attack, namely false alarm.
•
False negative (FN): The number of attack that falsely detected as normal.
•
Accuracy= $\frac{TP + TN}{TP + TN + FP + FN} \times 100 % .$
•
MCC is

Conclusion

In this paper, a visualized feature generalization approach is proposed to intrusion detection, and the flowchart is shown in Fig. 6. After the raw data is reduced from 494,307 to 145,831 by deleting repetitions, 17,480 data is randomly chosen as the experimental data, of which the first 1748 data is considered as the re-substitution experiment data and the whole 17,480 data is treated as the generalization experiment data. By using the four angle star to simulate the distance between an

Acknowledgements

This paper is partly supported by the National Creative Innovation Plan of College Students of China. R. R. (Grant No. 201210504090), National Natural Science Foundation of China (Grant No. 61202305), and project 2013PY120 supported by the fundamental research funds for the central universities.

References (31)

M.S. Abadeh et al.
Design and analysis of genetic fuzzy systems for intrusion detection in computer networks
Expert Systems with Applications
(2011)
J.F.C. Joseph et al.
CARRADS: Cross layer based adaptive real-time routing attack detection system for MANETS
Computer Networks
(2010)
C. Kelleher et al.
Ten guidelines for effective data visualization in scientific publication
Environmental Modeling & Software
(2011)
L. Koc et al.
A network intrusion detection system based on a Hidden Naïve Bayes multiclass classifier
Expert Systems with Applications
(2012)
Y. Li et al.
An efficient intrusion detection system based on support vector machines and gradually feature removal method
Expert Systems with Applications
(2012)
H.J. Liao et al.
Intrusion detection system: A comprehensive review
Journal of Network and Computer Application
(2013)
S.W. Lin et al.
An intelligent algorithm with feature selection and decision rules applied to anomaly intrusion detection
Applied Soft Computing
(2012)
M.N. Mohammed et al.
Intrusion Detection System Based on SVM for WLAN
Procedia Technology
(2012)
S. Mukherjee et al.
Intrusion detection using Naïve Bayes classifier with feature reduction
Procedia Technology
(2012)
M. Panda et al.
A hybrid intelligent approach for network intrusion detection
Procedia Engineering
(2012)

S.L. Shieh et al.

A new approach for data clustering and visualization using self-organizing maps

Expert Systems with Applications

(2012)

M.Y. Su

Real-time anomaly detection systems for Denial-of-Service attacks by weighted k-nearest-neighbor classifiers

(2011)

G.C. Tjhai et al.

A preliminary two-stage alarm correlation and filtering system using SOM neutral network and K-means algorithm

Computers & Security

(2010)

C.F. Tsai et al.

A triangle area based nearest neighbors approach to intrusion detection

Pattern Recognition

(2010)

G. Wang et al.

A new approach to intrusion detection using artificial neutral networks and fuzzy clustering

Expert Systems with Applications

(2010)

Cited by (73)

An effective intrusion detection approach using SVM with naïve Bayes feature embedding
2021, Computers and Security
Network security has become increasingly important in recent decades, while intrusion detection system plays a critical role in protecting it. Various machine learning techniques have been applied to intrusion detection, among which SVM has been considered as an effective method. However, existing studies rarely take the data quality into consideration, which is essential for constructing a well-performed intrusion detection system beyond machine learning techniques. In this paper, we propose an effective intrusion detection framework based on SVM with naïve Bayes feature embedding. Specifically, the naïve Bayes feature transformation technique is implemented on the original features to generate new data with high quality; then, an SVM classifier is trained using the transformed data to build the intrusion detection model. Experiments on multiple datasets in intrusion detection domain validate that the proposed detection method can achieve good and robust performances, with 93.75% accuracy on UNSW-NB15 dataset, 98.92% accuracy on CICIDS2017 dataset, 99.35% accuracy on NSL-KDD dataset and 98.58% accuracy on Kyoto 2006+ dataset. Furthermore, our method possesses huge advantages in terms of accuracy, detection rate and false alarm rate when compared to other methods.
Active learning to detect DDoS attack using ranked features
2019, Computer Communications
Network traffic classification to detect DDoS attacks is challenging in the context of high-speed networks. In this paper, we discuss the need for distributed feature selection in intrusion detection systems using parallel computing. This paper presents a parallel cumulative ranker algorithm to rank the attributes of a dataset for cost-effective classification of network traffic. We use MIT-DARPA, CAIDA, ISCX-IDS and TU-DDoS datasets to validate our method. Our feature ranking algorithm on large datasets (50,000-1,000,000 instances) finds best possible features from the above mentioned datasets and gives high accuracy (92%-97%) in a parallel environment, which takes significantly less time (71%-85% lower) than a sequential environment. We also discuss the importance of active learning to select appropriate instances by an expert module in an unsupervised way to train an SVM binary classifier for detection of DDoS attack traffic. Our approach selects small batches of training samples from a dataset to yield classification of network traffic with high accuracy. Our approach on large data provides better accuracy in classification with fewer training samples. A case study looks into the detection of intrusion in power systems.
A novel approach to intrusion detection using SVM ensemble with feature augmentation
2019, Computers and Security
Citation Excerpt :
Many measures are taken to protect networks from intrusions and attacks, including firewall, data encryption, intrusion detection system and other techniques. Among these, intrusion detection system (IDS) which aims to classify user’s activity into normal or intrusion-related behavior based on rules or models has received considerable attentions (Luo and Xia, 2014; Tjhai et al., 2010). Although promising improvements in IDSs detection approaches have been achieved, the intrusion detection is an ongoing research area where many problems need to be further solved, such as high dimensionality, huge volume, constantly changes in environments and real-time detection (Bamakan et al., 2016).
Network security has been a very important problem. Intrusion detection systems have been widely used to protect network security. Various machine learning techniques have been applied to improve the performance of intrusion detection systems, among which ensemble learning has received a growing interest and is considered as an effective method. Besides, the quality of training data is also an essential determinant that can greatly enhance the detection capability. Knowing that the marginal density ratios are the most powerful univariate classifiers. In this paper, we propose an effective intrusion detection framework based on SVM ensemble with feature augmentation. Specifically, the logarithm marginal density ratios transformation is implemented on the original features with the goal of obtaining new and better-quality transformed training data; SVM ensemble was then used to build the intrusion detection model. Experiment results show that our proposed method can achieve a good and robust performance, which possesses huge competitive advantages when compared to other existing methods in terms of accuracy, detection rate, false alarm rate and training speed.
Marine Goal Optimizer Tuned Deep BiLSTM-Based Self-Configuring Intrusion Detection in Cloud
2024, Journal of Grid Computing
Analysis of Wireless Sensor Network Security Models: A Salient Approach for Deeper Inspection Using Deep Neural Networks
2023, Proceedings of the 2023 International Conference on Emerging Techniques in Computational Intelligence, ICETCI 2023
Machine Learning–Enabled Security Parameter Selection to Identify Attacks on the Cloud and Host
2023, Integration of Cloud Computing with Emerging Technologies Issues, Challenges, and Practices

View all citing articles on Scopus

View full text

A novel intrusion detection system based on feature generation with visualization strategy

Highlights

Abstract

Introduction

Section snippets

Data set

Evaluation criteria for prediction

Conclusion

Acknowledgements

Expert Systems with Applications

Computer Networks

Environmental Modeling & Software

Expert Systems with Applications

Expert Systems with Applications

Journal of Network and Computer Application

Applied Soft Computing

Procedia Technology

Procedia Technology

Procedia Engineering

Expert Systems with Applications

Real-time anomaly detection systems for Denial-of-Service attacks by weighted k-nearest-neighbor classifiers

Computers & Security

Pattern Recognition

Expert Systems with Applications