A novel feature extraction methodology using Siamese convolutional neural networks for intrusion detection

Moustakidis, Serafeim; Karlsson, Patrik

doi:10.1186/s42400-020-00056-4

Cybersecurity

Table 3 Well known DR methods from the literature that were compared with the proposed Vec2im-Siam

From: A novel feature extraction methodology using Siamese convolutional neural networks for intrusion detection

DR approach	Acronym	Description
Infinite Latent Feature Selection (Roffo et al. 2017)	ILFS	A probabilistic latent feature selection approach that performs the ranking step by considering all the possible subsets of features bypassing the combinatorial problem
Unsupervised graph-based filter (Roffo et al. 2015)	Inf-FS	In Inf-FS, each feature is a node in the graph, a path is a selection of features, and the higher the centrality score, the most important the feature. It assigns a score of importance to each feature by taking into account all the possible feature subsets as paths on a graph.
Relief-F (Liu and Motoda 2007)	Relief-F	An iterative, randomized, and supervised approach that estimates the quality of features according to how well their values differentiate data samples that are near to each other; it does not discriminate among redundant features, and performance decreases with few data.
Laplacian Score (He et al. 2005)	LS	The importance of a feature is evaluated by its power of locality preserving. In order to model the local geometric structure, this method constructs a nearest neighbor graph. LS algorithm seeks those features that respect this graph structure.
Fisher filter feature selection (Gu et al. 2011, Xue-qin et al. 2006)	Fisher	It computes a score for a feature as the ratio of interclass separation and intraclass variance, where features are evaluated independently, and the final feature selection occurs by aggregating the m top ranked ones.
Correlation-based Feature Selection (Shahbaz et al. 2016)	CFS	CFS sorts features according to pairwise correlations
Unsupervised Feature Selection with Ordinal Locality (Guo et al. 2017)	UFSOL	A clustering-based approach that preserves the relative neighborhood proximities and contributes to distance-based clustering
Least Absolute Shrinkage and Selection Operator (Hagos et al. 2017)	Lasso	This method applies a regularization process that penalizes the coefficients of the regression variables while setting the less relevant to zero to respect the constraint on the sum. FS is a consequence of this process when all the variables that still have non-zero coefficients are selected to be part of the model
Chi-square feature selection (Thaseen and Kumar 2017; Thaseen et al. 2018)	Chi2	It ranks features based on the statistical significance test and consider only those features that are dependent on the class label
Minimum redundancy maximum relevance (Nguyen et al. 2010)	mRMR	A FS algorithm that systematically performs variable selection, achieving a reasonable trade-off between relevance and redundancy.
Fuzzy Complementarity Criterion (Moustakidis et al. 2012, Moustakidis and Theocharis 2010)	FuzCoC	FS is driven by a fuzzy complementary criterion which assures that features are iteratively introduced, providing the maximum additional contribution with regard to the information content given by the previously selected features

Back to article page